2yrs
malaysia
Strong experience in data engineering, pipeline orchestration, and distributed data processing. Proficiency in Python and SQL. Experience with cloud data platforms, object storage, warehouses, workflow orchestration, and message queues. Familiarity with unstructured text data, NLP workflows, and ML data preparation. Understanding of data modeling, system reliability, monitoring, and performance tuning.
Strong experience in data engineering, pipeline orchestration, and distributed data processing. Proficiency in Python and SQL. Experience with cloud data platforms, object storage, warehouses, workflow orchestration, and message queues. Familiarity with unstructured text data, NLP workflows, and ML data preparation. Understanding of data modeling, system reliability, monitoring, and performance tuning.
We are looking for a Data Engineer to own the data pipelines, storage architecture, and AI-enablement layer for a media monitoring platform. This role will focus on building reliable data foundations for large-scale ingestion, processing, enrichment, and serving of multilingual media content for NLP and machine learning use cases.
Core Responsibilities