Who can apply for the Cloud Data Engineer role at humaineeti Kolkata?

Engineers with 0–3 years of experience and a Bachelor's or Master's in CS, Data Engineering, Information Systems, or related — or equivalent demonstrable engineering experience. Strong SQL, Spark basics, and hands-on AWS Athena / S3 exposure are required.

What cloud platforms will I work with?

Primarily AWS — Athena, S3, and adjacent services for lakehouse and warehouse workloads. Exposure to GCP and Azure is welcome. Open-table formats like Iceberg and Delta Lake are part of the stack.

Is this role onsite, hybrid, or remote in Kolkata?

Onsite in Kolkata, West Bengal. We work in-person so early-career engineers get direct mentorship from senior engineers on architecture, runtime cost, and data quality.

Are freshers eligible for the Cloud Data Engineer role?

Yes. Freshers with strong portfolio projects in SQL, Spark, or cloud data engineering are encouraged to apply. Coursework plus a real pipeline you have built and can talk through is enough to get in the door.

What does the interview process look like?

A short screen, a SQL and data-modeling round, a Spark / pipeline design discussion using your own work, and a culture conversation with senior engineers. The loop typically wraps in 1–2 weeks.

How do I apply for the Cloud Data Engineer opening?

Use the application form on this page. Submit your name, email, phone, LinkedIn, GitHub, a resume link, and a short note on a data pipeline you have built or contributed to. Shortlisted candidates hear back within a week.

Cloud Data Engineer Jobs in Kolkata · 0–3 Years

About the Role

Data is the substrate every AI agent we ship runs on. As a Cloud Data Engineer, you will build the pipelines, models, and storage that make customer context fast, correct, and trustworthy — for both BI and AI workloads.

You will write SQL that holds up under scrutiny, ship Spark jobs that finish, and design data layouts that other engineers and agents can reason about. This role is for someone who cares about correctness, schema sanity, and runtime cost — not just "moving data from A to B."

What You'll Do

Model and ship SQL across lakehouses and warehouses for analytics, BI, and AI.
Build ETL / ELT pipelines on Spark — batch and incremental, with sensible partitioning and schema evolution.
Stand up AWS Athena queries and external tables over S3 data lakes.
Curate DWH layers — raw, staged, conformed, marts — with contracts and tests.
Wire pipelines into RAG, text-to-SQL, and agent workflows.
Own data quality, observability, and runbooks for the pipelines you ship.

Required Skills

SQL — joins, window functions, CTEs, query plans.
DWH — dimensional modeling, SCDs, warehouse vs lakehouse trade-offs.
Spark — PySpark or Spark SQL; debug skew, ship a transformation.
AWS Athena & S3 — partitioned tables, Parquet, lake hygiene.
Python — working level for pipelines and utilities.
Engineering habits — testing, review, docs; curiosity about cost.

Nice to Have

Iceberg or Delta Lake table formats.
Airflow / dbt / similar orchestration.
Streaming pipelines — Kinesis, Kafka.
Open-source or competition data work.
Embeddings, vector stores, or RAG pipelines.

Qualifications

Bachelor's / Master's in CS, Data Engineering, Information Systems, or related — or equivalent engineering experience.
0–3 years of professional or substantial project experience in Data Engineering.
Comfortable in a startup environment — high ownership, ambiguity, fast iteration.

What We Offer

Real data and AI problems with paying customers.
Mentorship from senior engineers (ex-AWS, ex-Google, ex-IBM).
A culture where data quality and runtime cost are first-class.
Competitive compensation, learning budget, rapid growth path.
Direct exposure to architecture decisions from day one.

Job Facts

LocationKolkata, WB

ModeOnsite

Experience0–3 years

TypeFull-time

Reports toSr. Data Engineer

Apply for this position

Onsite · Kolkata · ~2 min

Why humaineeti · Kolkata

AI engineering company headquartered in Kolkata, West Bengal. Senior engineers from AWS, Google, and IBM. Customers across BFSI, Manufacturing, Media, and Retail.

Not back-office analytics. Cloud data engineering for AI — lakehouses, agentic pipelines, RAG, text-to-SQL, on real customer workloads in West Bengal.

Common Questions

Who can apply?

0–3 yrs with a BS / MS in CS, Data Engineering, or Information Systems — or equivalent project work. Strong SQL, Spark basics, and AWS Athena / S3 hands-on required.

What cloud platforms?

Primarily AWS — Athena, S3, and adjacent services. GCP and Azure exposure welcome. Iceberg and Delta Lake are part of the stack.

Onsite, hybrid, or remote?

Onsite in Kolkata, West Bengal. In-person mentorship on architecture, cost, and data quality.

Are freshers eligible?

Yes — strong portfolio projects in SQL, Spark, or cloud data engineering are sufficient.

What's the interview loop?

Screen → SQL / data-modeling round → Spark / pipeline design on your work → culture conversation. ~1–2 weeks.

How do I apply?

Use the form below. Include a resume link, GitHub, and a short note on a data pipeline you have built or contributed to.

Other Openings

/ APPLY · Cloud Data Engineer

Send your application

Tell us about you, share your GitHub or portfolio, and add a short note on a data pipeline you have built or contributed to — what you owned, what you would do differently.