About Affinity
Affinity is pioneering new frontiers in AdTech: developing solutions that push past today’s limits and open up new opportunities. We are a global AdTech company helping publishers discover better ways to monetize and enabling advertisers to reach the right audiences through new touchpoints. Operating across 10+ markets in Asia, the US, and Europe with a team of over 500 experts, we are building privacy-first ad infrastructure that opens up opportunities beyond the walled gardens.
Role: Sr. Software Engineer, Database
Work Location: Mumbai (Malad)
Product: Siteplug.com
Roles & Responsibility:
Design, develop, and optimize database schemas, tables, indexes, and relationships to ensure efficient data storage and retrieval.
Write complex SQL queries, stored procedures, triggers, and functions to support business and application requirements.
Gather, clean, and process raw structured and unstructured data from multiple sources (APIs, relational DBs, distributed file systems).
Design and implement ETL pipelines for data ingestion, transformation, and storage using MySQL, Hadoop, and Spark.
Perform query optimization, indexing, and partitioning to improve database performance.
Manage replication, clustering, and failover strategies to ensure high availability.
Design and manage large-scale datasets using Hadoop ecosystem components (HDFS, MapReduce, Hive, Impala, Kafka, HBase, Pig).
Build and maintain real-time streaming pipelines using Apache Spark and Spark Streaming.
Collaborate with DevOps to support CI/CD pipelines for database-related deployments.
Take end-to-end responsibility for database lifecycle management (MySQL + Big Data ETL + Analytics).
Required Skills:
5+ years of SQL (MySQL) experience.
2+ years hands-on experience with Cloudera Hadoop Distribution and Apache Spark.
Proficiency in database development (queries, triggers, stored procedures) and knowledge of DB internals.
Experience with database administration, performance tuning, replication, backup, and restoration.
Comprehensive knowledge of Hadoop Architecture, HDFS, MapReduce, Hive, Impala, Kafka, HBase, Pig, and Java.
Experience in processing large structured & unstructured datasets.