Lead Java Developer with PySpark

Qode
Full-time
On-site
Job Title: Lead Java Developer with PySpark
Locations: Dallas, TX / Pittsburgh, PA / Cleveland, OH Hybrid (Onsite 2–3 days per week)
 
Job Summary:
We are seeking a highly experienced Lead Java Developer with expertise in Apache Spark (PySpark) to join a critical data engineering and backend development initiative. The ideal candidate will lead a team of engineers in designing and implementing scalable backend solutions, integrating with large-scale data pipelines.
Key Responsibilities:
  • Lead the design and development of scalable backend services using Java and Spring Boot
  • Develop batch and real-time data processing pipelines using PySpark on Apache Spark
  • Work closely with Data Engineers, Architects, and DevOps teams to ensure smooth data flow and platform stability
  • Optimize and tune performance of Java applications and PySpark jobs
  • Implement and enforce best practices in code quality, testing, and deployment
  • Participate in Agile ceremonies and work closely with business stakeholders to deliver solutions
  • Mentor junior developers and review code to ensure maintainability and performance
Required Skills & Experience:
  • 10+ years of hands-on experience in Java development (Java 8+)
  • Strong experience with Spring Boot, REST APIs, and Microservices
  • Proficient in PySpark and distributed data processing using Apache Spark
  • Hands-on experience with Hadoop Ecosystem, HDFS, and Hive
  • Experience working with Kafka for data streaming
  • Strong knowledge of SQL, data modeling, and performance tuning
  • Exposure to CI/CD pipelines, Docker, and Kubernetes is a plus
  • Excellent problem-solving, communication, and leadership skills
Preferred Qualifications:
  • Experience in Banking / Financial Services domain
  • Familiarity with AWS / Azure cloud platforms
  • Exposure to tools like Airflow, Databricks, or Snowflake is a plus
Education:
  • Bachelor's or Master’s Degree in Computer Science, Engineering, or related field