About Affinity

Affinity is pioneering new frontiers in AdTech: developing solutions that push past today’s limits and open up new opportunities. We are a global AdTech company helping publishers discover better ways to monetize and enabling advertisers to reach the right audiences through new touchpoints. Operating across 10+ markets in Asia, the US, and Europe with a team of over 500 experts, we are building privacy-first ad infrastructure that opens up opportunities beyond the walled gardens.

Role: Sr. Software Engineer, Database

Work Location: Mumbai (Malad)

Product: Siteplug.com

About Role:
As a Senior Software Engineer, Database at SitePlug, you will play a key role in designing, developing, and optimizing large-scale databases and data pipelines that power our advertising technology platforms. You will be responsible for managing complex SQL queries, building robust ETL processes, and leveraging big data technologies such as Hadoop and Spark to process structured and unstructured data at scale. This is an individual contributor role that requires hands-on expertise in database development, performance tuning, and big data ecosystems, along with the ability to solve complex problems through efficient solution design. You will work closely with cross-functional engineering teams to integrate scalable data solutions into production systems, ensuring reliability, security, and high performance.

Roles & Responsibility:

Design, develop, and optimize database schemas, tables, indexes, and relationships to ensure efficient data storage and retrieval.

Write complex SQL queries, stored procedures, triggers, and functions to support business and application requirements.

Gather, clean, and process raw structured and unstructured data from multiple sources (APIs, relational DBs, distributed file systems).

Design and implement ETL pipelines for data ingestion, transformation, and storage using MySQL, Hadoop, and Spark.

Perform query optimization, indexing, and partitioning to improve database performance.

Manage replication, clustering, and failover strategies to ensure high availability.

Design and manage large-scale datasets using Hadoop ecosystem components (HDFS, MapReduce, Hive, Impala, Kafka, HBase, Pig).

Build and maintain real-time streaming pipelines using Apache Spark and Spark Streaming.

Collaborate with DevOps to support CI/CD pipelines for database-related deployments.

Take end-to-end responsibility for database lifecycle management (MySQL + Big Data ETL + Analytics).

Required Skills:

5+ years of SQL (MySQL) experience.

2+ years hands-on experience with Cloudera Hadoop Distribution and Apache Spark.

Proficiency in database development (queries, triggers, stored procedures) and knowledge of DB internals.

Experience with database administration, performance tuning, replication, backup, and restoration.

Comprehensive knowledge of Hadoop Architecture, HDFS, MapReduce, Hive, Impala, Kafka, HBase, Pig, and Java.

Experience in processing large structured & unstructured datasets.

Sr. Software Engineer, Database (SP)

Submit Your Application