Principal Data Engineer

Hybrid$192,500 – $275,000
United States

Tech Stack

PythonSQLETL

Job Description, Responsibilities & Requirements

About the Position

Gemini is seeking a Principal Data Engineer to lead data architecture and transformation efforts in their Data Team. This role is based in New York, New York, and requires in-person presence twice a week at the Gemini office.

About the Company

Gemini is a global crypto and Web3 platform founded by Cameron and Tyler Winklevoss in 2014. Our mission is to unlock the next era of financial, creative, and personal freedom by providing trusted access to the decentralized future. We envision a world where crypto reshapes the global financial system, internet, and money to create greater choice, independence, and opportunity for all.

The Department: Data

At Gemini, our Data Team is the engine that powers insight, innovation, and trust across the company. We bring together world-class data engineers, platform engineers, machine learning engineers, analytics engineers, and data scientists - all working in harmony to transform raw information into secure, reliable, and actionable intelligence.

The Role: Principal Data Engineer

As a Principal Data Engineer, you will set the technical direction for how data is modeled, processed, and delivered across the organization. You will partner closely with product, analytics, ML, finance, operations, and engineering teams to move, transform, and model data reliably, with observability, resilience, and agility.

Responsibilities

  • Define and drive the long-term vision for data architecture, modeling, and transformation at Gemini
  • Establish standards for data reliability, observability, and quality across all pipelines and data products using languages and frameworks such as Python, SQL, Spark, Flink, Beam, or equivalents
  • Partner with Staff and Senior Data Engineers, Platform Engineers, and Analytics Engineers to unify how data is produced, stored, and consumed
  • Lead large-scale design initiatives that span multiple teams and systems, ensuring maintainability, performance, and security
  • Partner with data scientists, ML engineers, analysts, and product teams to understand data requirements, define SLAs, and deliver coherent data products that others can self-serve
  • Establish data quality, validation, observability, and monitoring frameworks (data auditing, alerting, anomaly detection, data lineage)
  • Investigate and resolve complex production issues: root cause analysis, performance bottlenecks, data integrity, fault tolerance
  • Mentor and guide more junior and mid-level data engineers: lead code reviews, design reviews, and best-practice evangelism
  • Help recruit and onboard new talent, shaping the future of Gemini’s data engineering discipline
  • Stay up to date on new tools, technologies, and patterns in the data and cloud space, bringing proposals and proof-of-concepts when appropriate
  • Document data flows, data dictionaries, architecture patterns, and operational runbooks

Requirements

  • 10+ years of experience in data engineering (or similar) roles
  • Strong experience in ETL/ELT pipeline design, implementation, and optimization
  • Deep expertise in Python and SQL writing production-quality, maintainable, testable code
  • Experience with large-scale data warehouses (e.g. Databricks, BigQuery, Snowflake)
  • Solid grounding in software engineering fundamentals, data structures, and systems thinking
  • Hands-on experience in data modeling (dimensional modeling, normalization, schema design)
  • Experience building systems with real-time or streaming data (e.g. Kafka, Kinesis, Flink, Spark Streaming), and familiarity with CDC frameworks
  • Experience with orchestration / workflow frameworks (e.g. Airflow)
  • Familiarity with data governance, lineage, metadata, cataloging, and data quality practices
  • Strong cross-functional communication skills; ability to translate between technical and non-technical stakeholders
  • Proven experience in recruiting, mentoring, leading design discussions, and influencing data-engineering best practices across teams

Preferred Qualifications

  • Experience with crypto, financial services, trading, markets, or exchange systems
  • Experience with blockchain, crypto, Web3 data - e.g. blocks, transactions, contract calls, token transfers, UTXO/account models, on-chain indexing, chain APIs, etc.
  • Experience with infrastructure as code, containerization, and CI/CD pipelines
  • Hands-on experience managing and optimizing Databricks on AWS

We Offer

  • Competitive starting salary
  • A discretionary annual bonus
  • Long-term incentive in the form of a new hire equity grant
  • Comprehensive health plans
  • 401K with company matching
  • Paid Parental Leave
  • Flexible time off

Salary Range

The base salary range for this role is between $192,500 - $275,000 in the State of New York, the State of California, and the State of Washington. This range is not inclusive of our discretionary bonus or equity package.

Work Location

This role requires a hybrid work approach at our hub offices, balancing the benefits of in-person collaboration with the flexibility of remote work. Expectations may vary by location and role, so candidates are encouraged to connect with their recruiter to learn more about the specific policy for the role.

Diversity and Inclusion

At Gemini, we strive to build diverse teams that reflect the people we want to empower through our products, and we are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. Equal Opportunity is the Law, and Gemini is proud to be an equal opportunity workplace. If you have a specific need that requires accommodation, please let a member of the People Team know.

Apply for this Job

Interested in building your career at Gemini? Get future opportunities sent straight to your email.

Job Details

Company name:
Gemini Espresso
Salary:
$192,500 – $275,000
Location:
United States
Employment Type:
Full-time
Work Mode:
Hybrid
Posted on TheJob:
6/13/2026
Last checked:
6/13/2026
Apply Now
© 2026 TheJob, Inc. All rights reserved.