Job Title: Lead Java Developer with PySpark
Locations: Dallas, TX / Pittsburgh, PA / Cleveland, OH Hybrid (Onsite 2–3 days per week)
Job Summary:
We are seeking a highly experienced Lead Java Developer with expertise in Apache Spark (PySpark) to join a critical data engineering and backend development initiative. The ideal candidate will lead a team of engineers in designing and implementing scalable backend solutions, integrating with large-scale data pipelines.
Key Responsibilities:
- Lead the design and development of scalable backend services using Java and Spring Boot
- Develop batch and real-time data processing pipelines using PySpark on Apache Spark
- Work closely with Data Engineers, Architects, and DevOps teams to ensure smooth data flow and platform stability
- Optimize and tune performance of Java applications and PySpark jobs
- Implement and enforce best practices in code quality, testing, and deployment
- Participate in Agile ceremonies and work closely with business stakeholders to deliver solutions
- Mentor junior developers and review code to ensure maintainability and performance
Required Skills & Experience:
- 10+ years of hands-on experience in Java development (Java 8+)
- Strong experience with Spring Boot, REST APIs, and Microservices
- Proficient in PySpark and distributed data processing using Apache Spark
- Hands-on experience with Hadoop Ecosystem, HDFS, and Hive
- Experience working with Kafka for data streaming
- Strong knowledge of SQL, data modeling, and performance tuning
- Exposure to CI/CD pipelines, Docker, and Kubernetes is a plus
- Excellent problem-solving, communication, and leadership skills
Preferred Qualifications:
- Experience in Banking / Financial Services domain
- Familiarity with AWS / Azure cloud platforms
- Exposure to tools like Airflow, Databricks, or Snowflake is a plus
Education:
- Bachelor's or Master’s Degree in Computer Science, Engineering, or related field