Junior Data Engineer Addepto

New

Skills

Airflow AWS Docker Java Kubernetes Python Scala Spark

Join Addepto, a renowned consulting and technology company recognized by Forbes as a top 10 AI company, specializing in AI and Big Data solutions for global enterprises and innovative startups. As a Junior Data Engineer, you will work on impactful, large-scale data projects across industries such as automotive, aerospace, and telecommunications, leveraging cutting-edge technologies and cloud platforms. This remote-friendly role offers a supportive, growth-oriented environment with opportunities to collaborate with leading brands and passionate experts.

Job Overview

As a Junior Data Engineer at Addepto, you will develop and maintain high-performance data processing platforms, design data pipelines, and ensure the scalability and reliability of data systems. You will work closely with cross-functional teams to integrate diverse data sources and optimize workflows, supporting mission-critical business decisions for top-tier clients.

Key Responsibilities
  • Develop and maintain scalable, high-performance data processing platforms for automotive and enterprise data.
  • Design and implement robust data pipelines for both streaming and batch data processing.
  • Optimize data workflows using tools like Spark, Cloudera, and Airflow.
  • Manage structured and unstructured data using data lake technologies such as Iceberg.
  • Collaborate with cross-functional teams to gather requirements and integrate multiple data sources.
  • Monitor platform performance, ensuring high availability and accuracy.
  • Leverage AWS cloud services for infrastructure management and workload scaling.
  • Write and maintain high-quality code in Python (or Java/Scala) for data processing and automation.
Required Skills & Qualifications
  • Minimum 1 year of hands-on experience with Big Data systems, data governance, or data management.
  • Proficient programming skills in Python, Java, or Scala; strong OOP and clean coding practices.
  • Practical experience with Spark, Cloudera, Airflow, NiFi, Docker, Kubernetes, Iceberg, Trino, or Hudi.
  • Solid understanding of dimensional data and data modeling techniques.
  • Experience deploying solutions in cloud environments (AWS, Azure, etc.).
  • Consulting experience with excellent communication and client management skills.
  • Ability to work independently, take ownership, and deliver high-quality results.
  • Fluent English (minimum C1 level).
  • Bachelor’s degree in a technical or mathematical field.
  • Nice to have: experience with MLOps frameworks (Kubeflow, MLFlow), familiarity with Databricks, dbt, or Kafka.

Job Type: Remote

Salary: Not Disclosed

Experience: Entry

Duration: 12 Months

Share this job:

Similar Jobs

Developer Advocate, DataHub

Posted 7 days ago

Empower and support DataHub developers

Create educational technical content

Airflow Apache Kafka Communication Community engagement

Senior Solutions Architect Role

Posted 9 days ago

Design and implement scalable data solutions

Lead and mentor engineering teams

Airflow Azure Databricks Hadoop

Senior Solutions Architect Role

Posted 10 days ago

Architect and deliver scalable data solutions

Lead and mentor engineering teams

Airflow AWS Databricks Hadoop

Senior Data Engineer Role

Posted 12 days ago

Architect scalable and reliable data infrastructure

Empower data-driven decision making and analytics

Airflow AWS Big Data Engineer

Lead Data Engineer Role

Posted 12 days ago

Architect scalable data platforms and pipelines

Drive innovation in cloud-native product development

Airflow Aws glue Docker Engineer

Data Governance Engineering Lead

Posted 13 days ago

Lead data governance strategy and execution

Build and manage governance-aware data pipelines

Airflow Architecture Computer science Databricks

AI Platform Lead Engineer

Posted 21 days ago

Design scalable AI data platforms

Optimize ML pipeline efficiency and resource allocation

Airflow BigQuery Dataflow Pyspark

Generative ML Engineer Role

Posted 21 days ago

Design and scale generative AI infrastructure

Develop and fine-tune generative video and visual models

Airflow Engineer Machine Learning Prompt Engineering

Data Engineer Crypto Platform

Posted 22 days ago

Design scalable data infrastructure

Build and maintain high-quality data pipelines

Airflow Big Data Engineer Kafka

Data Engineer, Solar Solutions

Posted 23 days ago

Enable data-informed decision-making organization-wide

Design and implement scalable cloud-based ETL/ELT solutions

Airflow Lambda NoSQL Pyspark

Remote Staff Data Engineer

Posted 24 days ago

Hire a remote data engineer

Build automated communication systems

Airflow AWS Data Data Warehousing

Data Governance Engineering Lead

Posted 31 days ago

Lead global data governance strategy and execution

Build and maintain governance-aware data pipelines

Airflow Architecture Databricks Infosec

Wikimedia Data Engineer Role

Posted 32 days ago

Design and maintain scalable data pipelines

Ensure data quality and reliability

Airflow CI/CD Hadoop hive

Senior ML Engineer, Ads

Posted 33 days ago

Lead end-to-end ML ad targeting product development

Drive technical research and strategic roadmap

Airflow BigQuery Deep Learning Machine Learning

Compliance Data Analyst Role

Posted 40 days ago

Develop and automate compliance dashboards and reports

Support regulatory reporting and audit readiness

Airflow AWS Data Analysis Data Analyst

BI Analyst – Sporty Group

Posted 42 days ago

Deliver actionable business insights

Develop and optimize data pipelines

A/b Testing Ab testing Airflow AWS

Senior Solutions Architect Role

Posted 42 days ago

Design and implement scalable data solutions

Lead and mentor engineering teams

Airflow AWS Azure Hadoop

Senior Data Engineer Role

Posted 54 days ago

Design and build scalable data architectures

Lead customer-facing technical engagements

Airflow Big Data Engineer Google Cloud Platform

Senior Data Engineer Role

Posted 54 days ago

Architect scalable and reliable data pipelines

Develop and launch self-serve analytics products

Airflow AWS Databricks Engineer

Senior Data Engineer Wikimedia

Posted 57 days ago

Design and maintain scalable data pipelines

Ensure data quality and governance

Airflow Docker Hadoop hive

Senior Data Engineer Wikimedia

Posted 58 days ago

Develop and maintain scalable data pipelines

Ensure data quality and robust monitoring

Airflow Docker Engineer Hadoop

Senior Data Engineer Wikimedia

Posted 58 days ago

Design and build scalable data pipelines

Ensure data quality and system reliability

Airflow Docker Engineer Hadoop

Senior Data Engineer Wikimedia

Posted 58 days ago

Develop robust, scalable data pipelines

Ensure high data quality and governance

Airflow Docker Engineer Hadoop

Senior Data Engineer Wikimedia

Posted 58 days ago

Build scalable and robust data pipelines

Enhance data quality and governance

Airflow Docker Engineer Hadoop

Senior Data Engineer Wikimedia

Posted 58 days ago

Develop scalable and robust data pipelines

Ensure high data quality and system reliability

Airflow Docker Engineer Hadoop

Senior Data Engineer Wikimedia

Posted 63 days ago

Design and build scalable data pipelines

Ensure data quality and robust monitoring

Airflow Docker Engineer Hadoop

Wikimedia Data Engineer Role

Posted 63 days ago

Develop scalable and robust data pipelines

Enhance data quality through monitoring and alerting

Airflow CI/CD Engineer hive

Lead Data Engineer Role

Posted 64 days ago

Lead and architect scalable data platforms and pipelines

Apply modern software engineering and cloud-native principles

Airflow Aws glue Devops Docker

Wikimedia Data Engineer Role

Posted 64 days ago

Build and maintain scalable data pipelines

Ensure data quality and governance across systems

Airflow CI/CD Engineer Hadoop

Staff ML Engineer, Apollo

Posted 69 days ago

Lead development of scalable ML systems

Advance Apollo's AI-native product features

Airflow Architecture Engineer Machine Learning

BI Analyst – Sports Betting

Posted 74 days ago

Mine and analyze large-scale business data

Develop and maintain dashboards and reports

A/b Testing Ab testing Airflow Analyst

Staff ML Engineer, Apollo

Posted 101 days ago

Lead and scale ML-driven product features

Develop and optimize AI-first user experiences

Airflow Databricks Engineer Machine Learning

Senior Data Science Manager

Posted 111 days ago

Lead and mentor a data science team

Integrate analytics into business strategy

Airflow Amplitude AWS Databricks

Cloud Data Solutions Architect

Posted 112 days ago

Lead and manage modern cloud data platforms for clients.

Provide architectural guidance and operational support.

Airflow AWS Azure CI/CD

Cloud Data Solutions Architect

Posted 115 days ago

Lead operation and management of cloud data platforms

Provide architectural guidance and technical leadership

Airflow AWS Azure CI/CD

Senior Data Engineer Wikimedia

Posted 117 days ago

Design and build scalable data pipelines

Ensure data quality and robust monitoring

Airflow Docker Engineer Hadoop

Senior Data Engineer Wikimedia

Posted 117 days ago

Design and build scalable data pipelines

Ensure and monitor data quality

Airflow Docker Engineer Hadoop

Senior Data Engineer Wikimedia

Posted 117 days ago

Develop and maintain scalable data pipelines

Ensure data quality, governance, and lineage

Airflow Docker Engineer Hadoop

Senior Data Engineer Wikimedia

Posted 117 days ago

Design and develop scalable data pipelines

Enhance data quality, governance, and lineage

Airflow Docker Engineer Hadoop

Senior Data Engineer Wikimedia

Posted 117 days ago

Design and maintain scalable data pipelines

Ensure data quality, governance, and lineage

Airflow Docker Engineer Hadoop

Senior Data Engineer Wikimedia

Posted 117 days ago

Design and scale data infrastructure

Ensure data quality and governance

Airflow Docker Engineer Hadoop

Junior Data Scientist Germany

Posted 118 days ago

Deliver actionable business insights

Collaborate across cross-functional teams

Airflow AWS Looker Pandas

Lead Data Engineer Role

Posted 154 days ago

Design and implement scalable data solutions

Lead and mentor engineering teams

Airflow AWS Databricks Java

Senior Data Engineer Africa

Posted 161 days ago

Ensure high-quality, reliable data management.

Automate data quality assurance processes.

Airflow AWS Lambda Postgresql

Senior Data Engineer

Posted 186 days ago

Architect scalable data models and ETL pipelines

Design and launch self-serve analytics products

Airflow AWS Big Data Databricks

Senior ML Engineer II at Apollo

Posted 188 days ago

Build and productionize Machine Learning models for Apollo products

Optimize users' experience at all stages of their product journey

Ai Systems Airflow Cloud Computer science

Staff Data Engineer

Posted 191 days ago

Design and build efficient data pipelines for large volumes of data

Optimize transformation models and data pipelines

Airflow Python Redshift Sql

Technical Delivery Manager

Posted 208 days ago

Lead and motivate client technical teams for modern data platforms

Maintain knowledge of modern data technology for best practices

Airflow Architecture AWS Azure

Data Sales Automation Platform

Posted 218 days ago

Enhancing media sales innovation through automation and intelligent proposals

Driving growth and optimization for media companies and agencies

Airflow Data Modeling Data Warehousing Etl Processes

ML Engineer on Apollo Team

Posted 218 days ago

Build and deploy ML models for Apollo products.

Enhance user experience through data-driven insights.

Airflow Cloud Computer science Databricks
overtime