Job Description
Senior Java Spark Developer
Job Summary –
We are seeking a Senior Java Spark Developer with expertise in Java, Apache Spark, and the Cloudera Hadoop Ecosystem to design and develop large-scale data processing applications. The ideal candidate will have strong hands-on experience in Java-based Spark development, distributed computing, and performance optimization for handling big data workloads.
Responsibilities:
✅ Java & Spark Development:
Develop, test, and deploy Java-based Apache Spark applications for large-scale data processing.
Optimize and fine-tune Spark jobs for performance, scalability, and reliability.
Implement Java-based microservices and APIs for data integration.
✅ Big Data & Cloudera Ecosystem:
Work with Cloudera Hadoop components such as HDFS, Hive, Impala, HBase, Kafka, and Sqoop.
Design and implement high-performance data storage and retrieval solutions.
Troubleshoot and resolve performance bottlenecks in Spark and Cloudera platforms.
✅ Collaboration & Data Engineering:
Collaborate with data scientists, business analysts, and developers to understand data requirements.
Implement data integrity, accuracy, and security best practices across all data processing tasks.
Work with Kafka, Flume, Oozie, and Nifi for real-time and batch data ingestion.
✅ Software Development & Deployment:
Implement version control (Git) and CI/CD pipelines (Jenkins, GitLab) for Spark applications.
Deploy and maintain Spark applications in cloud or on-premises Cloudera environments.
Years of experience needed –
8+ years of experience in application development, with a strong background in Java and Big Data processing.
Technical Skills:
Strong hands-on experience in Java, Apache Spark, and Spark SQL for distributed data processing.
Proficiency in Cloudera Hadoop (CDH) components such as HDFS, Hive, Impala, HBase, Kafka, and Sqoop.
Experience building and optimizing ETL pipelines for large-scale data workloads.
Hands-on experience with SQL & NoSQL databases like HBase, Hive, and PostgreSQL.
Strong knowledge of data warehousing concepts, dimensional modeling, and data lakes.
Proven ability to troubleshoot and optimize Spark applications for high performance.
Familiarity with version control tools (Git, Bitbucket) and CI/CD pipelines (Jenkins, GitLab).
Exposure to real-time data streaming technologies like Kafka, Flume, Oozie, and Nifi.
Strong problem-solving skills, attention to detail, and ability to work in a fast-paced environment
Excellent Communication skills
Certifications Needed:
NA
About Mphasis
Mphasis applies next-generation technology to help enterprises transform businesses globally. Customer centricity is foundational to Mphasis and is reflected in the Mphasis’ Front2Back™ Transformation approach. Front2Back™ uses the exponential power of cloud and cognitive to provide hyper-personalized (C=X2C2TM=1) digital experience to clients and their end customers. Mphasis’ Service Transformation approach helps ‘shrink the core’ through the application of digital technologies across legacy environments within an enterprise, enabling businesses to stay ahead in a changing world. Mphasis’ core reference architectures and tools, speed and innovation with domain expertise and specialization are key to building strong relationships with marquee clients.
Equal Opportunity Employer:
Mphasis is an equal opportunity/affirmative action employer. We provide equal employment opportunities to applicants and existing associates and evaluate qualified candidates without regard to race, gender, national origin, ancestry, age, color, religious creed, marital status, genetic information, sexual orientation, gender identity, gender expression, sex (including pregnancy, breast feeding and related medical conditions), mental or physical disability, medical conditions military and veteran status or any other status or condition protected by applicable federal, state, or local laws, governmental regulations and executive orders. View the EEO in the law poster , view the EEO in the law supplement . To view the pay transparency nondiscrimination provision please click and to view the E-Verify posting click .
Mphasis is committed to providing reasonable accommodations to individuals with disabilities. If you need a reasonable accommodation because of disability to search and apply for a career opportunity, please send an email to accomodationrequest@mphasis.com and let us know your contact information and th e nature of your request.
Skills
PRIMARY COMPETENCY : Java Technologies PRIMARY SKILL : Java Spring MVC , WEB & Spring Boot PRIMARY SKILL PERCENTAGE : 70 SECONDARY COMPETENCY : Big Data Technologies SECONDARY SKILL : Apache Spark SECONDARY SKILL PERCENTAGE : 20 TERTIARY COMPETENCY : DevOps TERTIARY SKILL : CICD - Continuous Integration & Continuous Development TERTIARY SKILL PERCENTAGE : 10


