Principal Software Engineer, Data Engineering
TLDR
Architect a high-scale data platform powering analytics and AI agents, shaping data strategy and bridging upstream data producers with downstream analytics and AI teams.
Architect the data platform – drive the technical direction for a scalable, reliable data platform built on a medallion architecture that serves customer-facing analytics, reporting, and agentic AI from a unified foundation.
Build and optimize ingestion pipelines – design robust CDC, real-time streaming (Kafka, Flink), and batch processing pipelines that transform complex, nested document-oriented operational data into clean analytical models at enterprise scale.
Tame schema complexity – build resilient ingestion and transformation layers that gracefully handle deeply nested, continuously evolving document schemas — deciding where to absorb complexity (ingestion, transformation, or query time) and making those tradeoffs explicit and sustainable.
Serve AI and analytics consumption patterns – architect data products that support both traditional BI workloads (pre-aggregated dashboards, dimensional models for scorecards and reports) and emerging AI consumption patterns (low-latency retrieval, contextual assembly, freshness-sensitive agent queries).
Own data quality, contracts, and observability – establish the data trust infrastructure that makes cross-team data consumption reliable: schema contracts with upstream producers, data quality monitoring, lineage tracking, freshness SLAs, and clear escalation paths when things break.
Drive cost-aware architecture – own Snowflake warehouse optimization, compute governance, and cost-efficient pipeline design. Build the practices and visibility so the team makes principled cost/performance tradeoffs rather than discovering them on the invoice.
Bridge producers and consumers – collaborate across organizational boundaries to align upstream software engineering teams and downstream analytics and AI teams around unified data strategies, shared contracts, and engineering standards.
Lead and grow the team – technically lead and growth-coach a diverse crew of data engineers. Champion best practices across the full spectrum of data engineering disciplines, from low-level pipeline architecture to sophisticated data modeling and analytical query performance.
-
Demonstrated depth in building production data platforms that serve multiple consumption patterns – you've gone beyond traditional BI to support real-time product features, AI/ML workloads, or customer-facing analytics from the same data foundation.
-
Deep experience with the impedance mismatch between document-oriented operational stores and analytical systems – you've dealt with nested, schema-evolving source data (MongoDB, DynamoDB, or similar) and have opinions on where flattening and transformation should live.
-
Hands-on experience with data quality and trust at scale – you've built or operated schema registries, data contracts, quality monitoring, or lineage systems in an environment where multiple teams depend on shared data products.
-
Track record of cost-conscious data architecture – you've optimized Snowflake (or comparable) warehouse spend, designed compute governance policies, or re-architected pipelines to materially reduce cost without sacrificing reliability.
-
Strong instinct for the bridge role: you're as comfortable pushing back on an upstream team's schema change as you are negotiating freshness SLAs with a downstream AI consumer.
-
8+ years of professional software engineering experience, with significant time spent on distributed, data-intensive production systems – including substantial depth in data pipeline and platform architecture.
-
Deep hands-on expertise with modern data technologies: Snowflake, Apache Kafka, Apache Flink, and CDC tooling (Debezium or similar).
-
Experience developing and operating cloud data infrastructure at enterprise scale (AWS preferred), including infrastructure-as-code (Terraform) and CI/CD automation.
-
Strong programming skills in Python, Java, and SQL. You write production-grade code, not just scripts.
-
A track record of designing performant data models that support fast, efficient querying for analytical and product-facing use cases.
- Strong cross-functional communication skills - you work effectively with software engineers, data scientists, AI teams, and business stakeholders across organizational boundaries.
-
Experience mentoring engineers and building collaborative, high-performing teams.
Foundations:
Base salary range: $188,696 - $258,391
Employees may also be eligible for bonuses and other forms of compensation.
The above represents total expected compensation for this role. Actual compensation will depend on various job-related factors, including, but not limited to, location, experience, and job qualifications.
Highspot also offers the following employee benefits for this position:
-Comprehensive medical, dental, vision, disability, and life benefits
-Health Savings Account (HSA) with employer contribution
-401(k) Matching with immediate vesting on employer match
-Flexible PTO
-8 paid holidays and 5 paid days for Annual Holiday Week
-Quarterly Recharge Fridays (paid days off for mental health recharge)
-18 weeks paid parental leave
-Access to Coaches and Therapists through Modern Health
-2 volunteer days per year
-Commuting benefits
#LI-DL1
Benefits
Equity Compensation
401(k) Matching with immediate vesting on employer match
Health Insurance
Health Savings Account (HSA) with employer contribution
commuting benefits
Paid Parental Leave
18 weeks paid parental leave
Paid Time Off
2 volunteer days per year
Wellness Stipend
Access to Coaches and Therapists through Modern Health
Highspot is a leading software product company that specializes in sales enablement, leveraging AI and GenAI technologies within its SaaS platform to enhance sales productivity. Designed for sales teams, the platform provides intelligent content management, customer engagement tools, and actionable analytics, transforming the way organizations operate and empowering users to achieve their goals.
- Founded
- Founded 2012
- Employees
- 500+ employees
- Industry
- Internet Software & Services
- Total raised
- $400M raised