Junior Data Engineer Addepto

New

Skills

Airflow AWS Docker Java Kubernetes Python Scala Spark

Join Addepto, a renowned consulting and technology company recognized by Forbes as a top 10 AI company, specializing in AI and Big Data solutions for global enterprises and innovative startups. As a Junior Data Engineer, you will work on impactful, large-scale data projects across industries such as automotive, aerospace, and telecommunications, leveraging cutting-edge technologies and cloud platforms. This remote-friendly role offers a supportive, growth-oriented environment with opportunities to collaborate with leading brands and passionate experts.

Job Overview

As a Junior Data Engineer at Addepto, you will develop and maintain high-performance data processing platforms, design data pipelines, and ensure the scalability and reliability of data systems. You will work closely with cross-functional teams to integrate diverse data sources and optimize workflows, supporting mission-critical business decisions for top-tier clients.

Key Responsibilities
  • Develop and maintain scalable, high-performance data processing platforms for automotive and enterprise data.
  • Design and implement robust data pipelines for both streaming and batch data processing.
  • Optimize data workflows using tools like Spark, Cloudera, and Airflow.
  • Manage structured and unstructured data using data lake technologies such as Iceberg.
  • Collaborate with cross-functional teams to gather requirements and integrate multiple data sources.
  • Monitor platform performance, ensuring high availability and accuracy.
  • Leverage AWS cloud services for infrastructure management and workload scaling.
  • Write and maintain high-quality code in Python (or Java/Scala) for data processing and automation.
Required Skills & Qualifications
  • Minimum 1 year of hands-on experience with Big Data systems, data governance, or data management.
  • Proficient programming skills in Python, Java, or Scala; strong OOP and clean coding practices.
  • Practical experience with Spark, Cloudera, Airflow, NiFi, Docker, Kubernetes, Iceberg, Trino, or Hudi.
  • Solid understanding of dimensional data and data modeling techniques.
  • Experience deploying solutions in cloud environments (AWS, Azure, etc.).
  • Consulting experience with excellent communication and client management skills.
  • Ability to work independently, take ownership, and deliver high-quality results.
  • Fluent English (minimum C1 level).
  • Bachelor’s degree in a technical or mathematical field.
  • Nice to have: experience with MLOps frameworks (Kubeflow, MLFlow), familiarity with Databricks, dbt, or Kafka.

Job Type: Remote

Salary: Not Disclosed

Experience: Entry

Duration: 12 Months

Share this job:

Similar Jobs

Senior Data Engineer

Posted 23 days ago

Architect scalable data models and ETL pipelines

Design and launch self-serve analytics products

Airflow AWS Big Data Databricks

Senior ML Engineer II at Apollo

Posted 25 days ago

Build and productionize Machine Learning models for Apollo products

Optimize users' experience at all stages of their product journey

Ai Systems Airflow Cloud Computer science

Staff Data Engineer

Posted 28 days ago

Design and build efficient data pipelines for large volumes of data

Optimize transformation models and data pipelines

Airflow Python Redshift Sql

Technical Delivery Manager

Posted 45 days ago

Lead and motivate client technical teams for modern data platforms

Maintain knowledge of modern data technology for best practices

Airflow Architecture AWS Azure

Data Sales Automation Platform

Posted 55 days ago

Enhancing media sales innovation through automation and intelligent proposals

Driving growth and optimization for media companies and agencies

Airflow Data Modeling Data Warehousing Etl Processes

ML Engineer on Apollo Team

Posted 55 days ago

Build and deploy ML models for Apollo products.

Enhance user experience through data-driven insights.

Airflow Cloud Computer science Databricks

Staff Analytics Engineer

Posted 61 days ago

Enable informed decision-making through accessible data

Lead data vision and architecture for impactful insights

Airflow Data Engineering Data Modeling Etl

Senior Data Engineer Project

Posted 63 days ago

Develop data infrastructure and systems for various business functions

Implement data observability and monitoring

Airflow Big Data Data Security Devops

Data Lead: ETL & Analytics

Posted 70 days ago

Improve data infrastructure Optimize performance and accessibility Enable data-driven

g Collaborate with cross-functional teams Mentor and lead data

Airflow Analytics AWS BigQuery

Data Engineering Lead

Posted 70 days ago

- Manage platform APIs and AI capabilities - Oversee data system scalability and performance -

borate with data science and product teams - Implement AI and ML models into the platform - Ensure

Ai/ml Airflow AWS BigQuery

OpenSC Remote Jobs

Posted 71 days ago

- Enhance data solutions for sustainable food systems - Lead customer onboarding and supply chain

implementations - Transform sustainability goals into actionable solutions - Drive product

Airflow AWS Cloud Docker

AI Data Engineer

Posted 80 days ago

- Build production-grade data pipelines - Collaborate with cross-functional teams - Take on new

lenges and responsibilities - Shape company culture - Solve real-world complex

Ai Frameworks Airflow BigQuery Python

Remote UK Skilled Worker Visa Jobs

Posted 80 days ago

Build infrastructure software for data platforms, mentor engineers, provide HR support, collaborate

internationally in forensic accounting, hire exceptional talent, drive future

Airflow apache Docker Kubernetes

Remote AWS Developer Jobs

Posted 80 days ago

. Reduce emissions through technology innovation

. Collaborate with global enterprises

Airflow AWS Cloud devsecops

Data Engineering Manager

Posted 93 days ago

Own foundational data artifacts for the business domain Mentor, coach, and advocate for team

Design and build scalable data pipelines Contribute to data architecture and governance Ensure

Airflow Data Modeling Data Warehousing Kafka

Data Engineer

Posted 93 days ago

- Develop and maintain ETL pipelines - Implement data modeling techniques - Optimize data

ing and storage - Collaborate with cross-functional teams - Explore new technologies and

Airflow AWS Big Data Data Modeling

Health Insurance Data Analyst

Posted 101 days ago

Conduct SQL analysis for actionable insights, Maintain and optimize ML models, Analyze unstructured

logs, Develop ETL pipelines, Collaborate with engineering

Airflow Data Analysis Data Visualization Etl

Data Engineer

Posted 105 days ago

Design and develop infrastructure and tools for data systems; Generalize data points for multi-dimensional data stores; Build analytics lakehouse; Translate stakeholder requirements to solutions; Champion agile software development practices

Airflow AWS Postgres Python

AI Engineer - ML Ops

Posted 109 days ago

* Drive optimization in supply chain and manufacturing sector * Collaborate with cross-functional

ams to build high-quality product features * Deploy AI models to solve complex global problems *

Airflow BigQuery Python Pytorch

BI Analyst at Sporty Group

Posted 114 days ago

- Provide key insights for core business decisions - Enhance reporting processes with data

tion - Optimize existing reporting methods - Drive growth of core products - Maintain database

A/b Testing Airflow AWS Etl

Analytics Engineer for Real Estate

Posted 115 days ago

- Design and implement Data Pipelines with platform services and serverless solutions - Develop and

test ingestion pipelines from various sources - Create data transformations with SQL, Python, PaaS,

Airflow apache Etl Python

Remote Senior Analytics Engineer Jobs

Posted 115 days ago

- Develop data pipelines and transform data - Optimize data infrastructure for decision-making -

lyze product data and improve solutions - Enhance mental healthcare through data insights - Drive

Airflow AWS BigQuery Etl

Remote Analytics Engineer Jobs

Posted 115 days ago

Enhance data pipelines and models, Drive data-driven decision-making, Collaborate with

teams, Optimize data infrastructure, Analyze product

Airflow AWS Cloud Etl
overtime