All jobs / Data Engineering

Posted 8d ago (Jun 1, 25)

Acubed logoAcubedSunnyvale, California

Data Engineer

United States$124.227k - $157.001kFull-timePythonSQLPostgresqlClickHouseDagsterApache SparkDaskDockerSlurm

We are looking for a Data Engineer who specializes in managing and optimizing data pipelines, with a specific focus on AI/ML data and applications. The ideal candidate will have experience in designing and implementing data infrastructure, ensuring high-quality labeled datasets, and collaborating with cross-functional teams to support our computer vision initiatives.

Responsibilities

  • Manage data releases, tracking data provenance, evaluating data quality, providing data insight reporting.
  • Contribute to scalable data and labeling pipelines for computer vision projects.
  • Manage and optimize data storage solutions to ensure efficient data retrieval and processing.
  • Monitor and troubleshoot data pipeline issues, ensuring timely resolution and minimal disruption to project timelines.
  • Collaborate with machine learning engineers and other data/infrastructure engineers to understand data requirements and ensure the availability of high-quality labeled datasets.
  • Stretch: Develop and maintain tools and processes for data labeling, including annotation workflows, quality control, and validation procedure.

Requirements

  • Bachelor’s in Computer Science, Data Engineering, and 2+ years of experience in a related field.
  • Strong Python and SQL; Good software development fundamentals a must.
  • Experience as a Data Engineer, working with data pipelines.
  • Proficiency with:
    • Data modeling
    • ETL processes and data pipeline operations
    • OLTP databases (e.g. Postgresql)
    • OLAP databases (e.g. ClickHouse)
  • Excellent problem-solving skills and attention to detail.
  • Strong communication skills and the ability to work effectively in a collaborative team environment.

Strong Plus

  • Experience with evaluation of ML models.
  • Experience developing evaluation frameworks.

Nice to Have

  • Data pipelines for computer vision applications.
  • Data labeling.
  • Workflow orchestration tools, particularly Dagster.
  • Distributed computing frameworks (Apache Spark, Dask).
  • Containerization technologies (e.g. Docker).
  • Job scheduling systems (e.g. Slurm).

Salary and compensation

$124,227 - $157,001 per year

Benefits

health insurancepaid time offholidays401(k)Flexible Spending AccountHealth Savings AccountAirbus Employee Share Ownership Planflight traininghybrid work model with 3 days in office31 days per year remote work (including outside U.S.)
Apply for this position
Any feedback or want to report a concern?Help us maintain the quality of jobs posted on Nata in Data!Contact us

Similar jobs

ICONIQ logo

ICONIQ

Associate Data and Analytics Engineer

United States$130k - $150k
Middleby Marshal, Inc. logo

Middleby Marshal, Inc.

Data Engineer Intern

United States
The Hanover Insurance Group logo

The Hanover Insurance Group

Data Engineer

United States
Pattern logo

Pattern

Data Engineer

United States
Charles Schwab logo

Charles Schwab

Associate Data Engineer

United States
Abbvie logo

Abbvie

Data Engineer

United States$82.5k - $157.5k
Concentrix Catalyst logo

Concentrix Catalyst

Data Engineer

United States$130.541k - $137.186k
Newmark logo

Newmark

Data Engineer

United States$125k - $160k
Samsung Semiconductor logo

Samsung Semiconductor

Intern Data Engineer

United States
Garmin logo

Garmin

Data Engineer 2

United States
Lattice logo

Lattice

Data Engineer

United States$123k - $154k
Cargill logo

Cargill

Data Engineer

United States
Daikin Comfort logo

Daikin Comfort

Data Engineer 1

United States
View all jobs