Posted 8d ago (Jun 1, 25)
Data Engineer
United States$124.227k - $157.001kFull-timePythonSQLPostgresqlClickHouseDagsterApache SparkDaskDockerSlurm
We are looking for a Data Engineer who specializes in managing and optimizing data pipelines, with a specific focus on AI/ML data and applications. The ideal candidate will have experience in designing and implementing data infrastructure, ensuring high-quality labeled datasets, and collaborating with cross-functional teams to support our computer vision initiatives.
Responsibilities
- Manage data releases, tracking data provenance, evaluating data quality, providing data insight reporting.
- Contribute to scalable data and labeling pipelines for computer vision projects.
- Manage and optimize data storage solutions to ensure efficient data retrieval and processing.
- Monitor and troubleshoot data pipeline issues, ensuring timely resolution and minimal disruption to project timelines.
- Collaborate with machine learning engineers and other data/infrastructure engineers to understand data requirements and ensure the availability of high-quality labeled datasets.
- Stretch: Develop and maintain tools and processes for data labeling, including annotation workflows, quality control, and validation procedure.
Requirements
- Bachelor’s in Computer Science, Data Engineering, and 2+ years of experience in a related field.
- Strong Python and SQL; Good software development fundamentals a must.
- Experience as a Data Engineer, working with data pipelines.
- Proficiency with:
- Data modeling
- ETL processes and data pipeline operations
- OLTP databases (e.g. Postgresql)
- OLAP databases (e.g. ClickHouse)
- Excellent problem-solving skills and attention to detail.
- Strong communication skills and the ability to work effectively in a collaborative team environment.
Strong Plus
- Experience with evaluation of ML models.
- Experience developing evaluation frameworks.
Nice to Have
- Data pipelines for computer vision applications.
- Data labeling.
- Workflow orchestration tools, particularly Dagster.
- Distributed computing frameworks (Apache Spark, Dask).
- Containerization technologies (e.g. Docker).
- Job scheduling systems (e.g. Slurm).
Salary and compensation
$124,227 - $157,001 per yearBenefits
health insurancepaid time offholidays401(k)Flexible Spending AccountHealth Savings AccountAirbus Employee Share Ownership Planflight traininghybrid work model with 3 days in office31 days per year remote work (including outside U.S.)
Apply for this positionAny feedback or want to report a concern?Help us maintain the quality of jobs posted on Nata in Data!Contact us
Similar jobs

ICONIQ
Associate Data and Analytics Engineer
United States$130k - $150k
SQLPythonTableauLookerdiscretionary bonushealth insurance+1 more

Middleby Marshal, Inc.
Data Engineer Intern
United States
Azure Data FactoryAzure SynapseMicrosoft FabricSQL+1 more

The Hanover Insurance Group
Data Engineer
United States
PythonSQLAzure servicesPower BIMedical, dental, vision, life, and disability insurance401K with a company match+7 more

Pattern
Data Engineer
United States
SQLRedshiftBigQuerySnowflakeUnlimited PTOPaid Holidays+19 more

Charles Schwab
Associate Data Engineer
United States
SQLETLHadoopMongoDB401(k) with company match and Employee stock purchase planPaid time for vacation, volunteering, and 28-day sabbatical after every 5 years of service for eligible positions+7 more

Abbvie
Data Engineer
United States$82.5k - $157.5k
SnowflakeAzure SQL DatabaseRedShiftPythonpaid time off (vacation, holidays, sick)medical/dental/vision insurance+8 more

Concentrix Catalyst
Data Engineer
United States$130.541k - $137.186k
JavaMicroservicesMavenGitmedicaldental+5 more

Newmark
Data Engineer
United States$125k - $160k
QlikViewAlteryxPythonIndustry leading Parental Leave Policy (up to 16 weeks)Generous healthcare+5 more

Samsung Semiconductor
Intern Data Engineer
United States
C/C++PythonJIRA APIsRAGRelocation and housing stipends to support moving and living costs during the internship.Charitable giving match and opportunities for community involvement.+4 more

Garmin
Data Engineer 2
United States
AirflowKafkapySparkDocker+8 more

Lattice
Data Engineer
United States$123k - $154k
SQLdbtPythongitMedical insuranceDental insurance+14 more

Cargill
Data Engineer
United States
PythonSnowflake DatabaseSQLTerraform+2 more

Daikin Comfort
Data Engineer 1
United States
SQLAWS ‘big data’ technologies