Junior Data Engineer Addepto

New

Skills

Airflow AWS Docker Java Kubernetes Python Scala Spark

Join Addepto, a renowned consulting and technology company recognized by Forbes as a top 10 AI company, specializing in AI and Big Data solutions for global enterprises and innovative startups. As a Junior Data Engineer, you will work on impactful, large-scale data projects across industries such as automotive, aerospace, and telecommunications, leveraging cutting-edge technologies and cloud platforms. This remote-friendly role offers a supportive, growth-oriented environment with opportunities to collaborate with leading brands and passionate experts.

Job Overview

As a Junior Data Engineer at Addepto, you will develop and maintain high-performance data processing platforms, design data pipelines, and ensure the scalability and reliability of data systems. You will work closely with cross-functional teams to integrate diverse data sources and optimize workflows, supporting mission-critical business decisions for top-tier clients.

Key Responsibilities
  • Develop and maintain scalable, high-performance data processing platforms for automotive and enterprise data.
  • Design and implement robust data pipelines for both streaming and batch data processing.
  • Optimize data workflows using tools like Spark, Cloudera, and Airflow.
  • Manage structured and unstructured data using data lake technologies such as Iceberg.
  • Collaborate with cross-functional teams to gather requirements and integrate multiple data sources.
  • Monitor platform performance, ensuring high availability and accuracy.
  • Leverage AWS cloud services for infrastructure management and workload scaling.
  • Write and maintain high-quality code in Python (or Java/Scala) for data processing and automation.
Required Skills & Qualifications
  • Minimum 1 year of hands-on experience with Big Data systems, data governance, or data management.
  • Proficient programming skills in Python, Java, or Scala; strong OOP and clean coding practices.
  • Practical experience with Spark, Cloudera, Airflow, NiFi, Docker, Kubernetes, Iceberg, Trino, or Hudi.
  • Solid understanding of dimensional data and data modeling techniques.
  • Experience deploying solutions in cloud environments (AWS, Azure, etc.).
  • Consulting experience with excellent communication and client management skills.
  • Ability to work independently, take ownership, and deliver high-quality results.
  • Fluent English (minimum C1 level).
  • Bachelor’s degree in a technical or mathematical field.
  • Nice to have: experience with MLOps frameworks (Kubeflow, MLFlow), familiarity with Databricks, dbt, or Kafka.

Job Type: Remote

Salary: Not Disclosed

Experience: Entry

Duration: 12 Months

Share this job:

Similar Jobs

Data Engineer, Autonomous Vehicles

New

Build and maintain scalable data pipelines

Improve data models and schema design

Airflow AWS Etl Hadoop

Senior ML Engineer, Trust

New

Build ML models for business, product, and operations.

Collaborate to prevent incidents and develop detection strategies.

Airflow C++ Java Kafka

Data Engineer, Growth Platforms

Posted 3 days ago

Build and maintain reliable and scalable data pipelines for Growth use cases

Improve data models and schemas to meet evolving needs

Airflow AWS Data Warehousing Etl Processes

Senior Data Engineer

Posted 6 days ago

Architect scalable data pipelines for healthcare data

Lead technical decision-making for data infrastructure

Airflow AWS BigQuery Gcp

Data Engineer

Posted 8 days ago

Design, build, and maintain ETL/ELT pipelines

Develop and manage BigQuery data warehouse

Airflow BigQuery Cloud Composer Dataflow

Senior Analytics Engineer

Posted 8 days ago

Model and document new datasets for business value

Automate and optimize business metrics

Airflow Postgres Snowflake Tableau

Senior Analytics Engineer

Posted 9 days ago

Model and document new datasets for business value

Automate and align business metrics with success criteria

Airflow Postgres Snowflake Sql

Senior Data Engineer

Posted 10 days ago

Designing scalable data pipelines

Contributing to architecture decisions

Airflow CI/CD Python Sql

Data Platform Engineer

Posted 10 days ago

Design and build data-heavy services

Develop data integration SDKs

Airflow BI Go Java

Senior Analytics Engineer

Posted 11 days ago

Model and document new datasets

Automate business metrics

Airflow Cloud Platforms Databricks Postgresql

Senior Analytics Engineer

Posted 12 days ago

Model and document new datasets

Automate business metrics

Airflow Cloud Platforms Postgres Snowflake

Senior ML Engineer, Data and AI

Posted 13 days ago

Handle large-scale data efficiently

Build and improve ML models

Airflow C++ Java Kafka

Senior ML Engineer

Posted 13 days ago

Design, build, deploy, and monitor production-ready ML services.

Collaborate with AI scientists to package and deploy ML models.

Airflow AWS Databricks Java

Senior Product Analyst II

Posted 13 days ago

Analyze data for product and marketing opportunities

Design and analyze tests for launches

A/b Testing Airflow Excel Google Sheets

Data Engineer

Posted 13 days ago

Build and scale ETL/ELT pipelines and compute infra across clouds.

Ingest and integrate new data sources for analytics and ML.

Airflow Azure data factory Azure Synapse BigQuery

Staff Software Engineer, Backend

Posted 13 days ago

Set technical strategy and collaborate with cross-functional teams

Define technical solutions and operational processes

Airflow AWS Code Review Distributed systems

Ops Dev Engineer (Starlink)

Posted 14 days ago

Develop and scale network operations processes and tools

Analyze operations data for efficiency improvements

Airflow Grafana Postgresql Prometheus

Senior Data Engineer - Analytics

Posted 14 days ago

Plan and scale technology projects

Lead data architecture, security, and performance

Airflow Data Pipelines Data Warehousing Python

Engineering Manager Autonomy Validation

Posted 14 days ago

Develop vision and roadmap for compute frameworks and validation pipelines

Collaborate with AI teams for dataset automation

Airflow AWS Java Python

Senior Analytics Engineer

Posted 14 days ago

Modeling and documenting new datasets

Automating business metrics

Airflow Postgres Snowflake Sql

Senior Analytics Engineer

Posted 14 days ago

Model and document new datasets to drive business value

Automate business metrics for success alignment

Airflow Databricks Postgres Snowflake

Senior Software Engineer, Technical Search Visibility

Posted 15 days ago

Lead SEO and AEO projects

Design and ship full-stack features

Airflow Data Engineering Distributed systems Full-stack Development

Senior Data Engineer, Reporting

Posted 15 days ago

Design scalable ETL pipelines

Build real-time data ingestion systems

Airflow Etl Numpy Pandas

Senior Analytics Engineer

Posted 15 days ago

Model and document new datasets

Automate business metrics alignment

Airflow Databricks Postgres Snowflake

Analytics Engineer

Posted 15 days ago

Design and build analytics data platform foundation

Develop scalable data models

Airflow Data Engineering Python Snowflake

Data Developer

Posted 15 days ago

Design, build, and maintain data pipelines

Develop, evaluate, and iterate ML models

Airflow BigQuery Mlops Python

Data Engineer Project

Posted 16 days ago

Design, build & maintain data infrastructure.

Enhance data monitoring & alerting.

Airflow AWS Azure Code reviews

Senior Data Scientist, Identity

Posted 16 days ago

Analyze large datasets using SQL and scripting

Design and analyze AB experiments to optimize risk actions

Airflow Looker Snowflake Sql

Senior Data Engineer

Posted 16 days ago

Lead end-to-end design and delivery of scalable data products.

Identify automation and integration opportunities.

Ai Airflow Algorithms AWS

Senior Data Engineer

Posted 17 days ago

Architect high-scale data infrastructure

Build ELT/ETL workflows for data lakes and warehouses

Airflow AWS Gcp Hadoop

Senior Data Engineer

Posted 17 days ago

Architect, design, and maintain scalable data pipelines

Implement cloud architecture with security and governance

Airflow AWS Gcp Hadoop

Senior Backend Engineer

Posted 17 days ago

Lead and support a team of engineers

Collaborate with product, design, and analytics teams

Airflow AWS Data lake Kotlin

Senior Backend Software Engineer

Posted 17 days ago

Lead and manage a team of engineers

Collaborate with product, design, and analytics departments

Airflow AWS Backend Development Distributed systems

Senior Data Engineer

Posted 17 days ago

Architect high-scale data infrastructure

Design, build, and scale data pipelines

Airflow AWS Gcp Hadoop

Senior Data Engineer

Posted 19 days ago

Design and maintain data pipelines from diverse sources

Develop data models for analytics and optimization

Airflow Hbase Java Kafka

ML Engineer Risk Modeling

Posted 20 days ago

Deploy ML risk solutions at scale

Lead technical decisions and influence strategy

Airflow Deep Learning Machine Learning Pyspark

Data Engineering Lead

Posted 20 days ago

Lead and contribute to data engineering infra and pipelines

Manage a team of senior data engineers

Airflow Automation Communication Data Engineering

Senior ML Software Engineer

Posted 20 days ago

Contribute to roadmap and architecture based on tech and business needs

Write well-crafted, well-tested, readable, maintainable code

Airflow AWS Azure Go

Real-Time Bidding Systems Lead

Posted 21 days ago

Lead design and development of real-time bidding systems

Build robust data pipelines for insights

Airflow BigQuery Gcp Golang

Staff Data Engineer, Analytics

Posted 22 days ago

Lead design and implementation of shared data models and dimensions.

Standardize data engineering practices across teams.

Airflow Data Engineering Data Modeling Observability

Data Ecosystem Engineer II

Posted 22 days ago

Build self-service data experiences for analytics and reporting

Create fault-tolerant data pipelines across microservices and third-party systems

Ai Airflow Django Javascript

Analytics Engineer, Caper

Posted 23 days ago

Design and maintain production data models

Implement ELT in a cloud warehouse

Airflow BigQuery Git Looker

Data Platform Engineer

Posted 23 days ago

Build and maintain data platform capabilities

Contribute to data lake modernization

Airflow C# Databricks Go

Data Scientist - Growth

Posted 23 days ago

Drive growth through analytics and modeling

Integrate ML models for product enhancement

A/b Testing Airflow Causal Inference Python

Analytics Engineer

Posted 24 days ago

Design, build, and maintain scalable data models with SQL and dbt

Apply software engineering practices: versioning, testing, CI

Airflow BigQuery Data Modeling Data Warehousing

Healthcare Data Engineer

Posted 27 days ago

Develop secure, scalable data systems

Build and operate data pipelines

Airflow Azure EHR Etl

Senior Data Engineer

Posted 27 days ago

Design and build scalable data pipelines for clinical RWD

Implement Python/SQL ETL/ELT with AWS services

Agile Airflow CI/CD Git

Software Engineer Data Infrastructure

Posted 27 days ago

Design and implement data governance policies and procedures.

Build pipelines and warehouses for financial data reporting.

Airflow BigQuery Go Java

Senior Data Engineer, Infrastructure

Posted 27 days ago

Design and implement IAM policies, audit logging, and scalable access for data governance.

Build and ensure data integrity for financial data infrastructure.

Airflow BigQuery Python Spark

Senior DevOps Engineer II

Posted 28 days ago

Own critical infrastructure supporting engineering teams

Design, deploy, and operate Kubernetes platforms

Airflow AWS Bash Gcp
overtime