Site Reliability Engineer (SRE) Apache Flink & Kubernetes Job at Purple Drive, Jersey City, NJ

Mm1Hd092VUdQMDlyeEtGbTFLL1g5S1k5ckE9PQ==
  • Purple Drive
  • Jersey City, NJ

Job Description

**************LOCAL PREFERRED***********************

We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Apache Flink, Kubernetes, and automation . The ideal candidate will be responsible for designing, deploying, and maintaining scalable, resilient systems, while ensuring high availability and performance in production environments. This role requires a solid background in distributed systems, container orchestration, and DevOps practices.

Key Responsibilities

  • Design, implement, and maintain scalable Apache Flink deployments on Kubernetes .
  • Develop automation tools and scripts to streamline deployment, monitoring, and maintenance of Flink jobs and infrastructure.
  • Ensure high availability, scalability, and reliability of production systems.
  • Collaborate with development and infrastructure teams to optimize application performance.
  • Build and manage monitoring/alerting systems using Prometheus, Grafana, ELK stack, or similar tools .
  • Work with cloud platforms (AWS, GCP, Azure) to design and manage infrastructure.
  • Apply best practices for networking, security, and container orchestration .
  • Troubleshoot complex production issues and drive root cause analysis.
  • Contribute to CI/CD pipelines for deployment automation.
  • Participate in on-call rotations to ensure uptime and reliability.
Required Skills & Qualifications

  • Strong hands-on experience with Apache Flink in production environments .
  • Expertise in Kubernetes (Helm, Operators, CRDs).
  • Proficiency in scripting languages ( Python, Bash, Go ).
  • Experience with monitoring & observability tools (Prometheus, Grafana, ELK, etc.).
  • Solid understanding of cloud platforms (AWS, GCP, Azure).
  • Strong knowledge of networking, security, and container orchestration .
  • Familiarity with CI/CD pipelines and DevOps practices .
  • Excellent problem-solving, debugging, and communication skills.

Job Tags

Local area,

Similar Jobs

CyberCoders

Commercial Paint Project Manager Job at CyberCoders

 ...Project Manager Position Overview We are seeking an experienced Project Manager to oversee and coordinate our commercial painting projects. The ideal candidate will have a strong background in coatings and painting, ensuring that projects are completed on time,... 

City and County of San Francisco

Police Officer - New Recruit (Entry Level) Job at City and County of San Francisco

 ...at application; California license by hire Background: No felony; no domestic-violence conviction; no misdemeanor prohibiting firearm ownership; not restricted from CCSF employment Hiring Process (Overview) Written Exam (Pass/Fail) choose one: FrontLine National... 

System One

Nuclear Digital I&C IV&V Engineer Job at System One

 ...Job Title: Nuclear Digital I&C IV&V Engineer Type: Contract Contractor Work Model: Remote(with potential travel requirements) As a Nuclear Digital I&C Independent Verification and Validation (IV&V) Engineer, you will play a crucial role in overseeing and maintaining... 

MCI Careers

Call Center Representative (Blended) Job at MCI Careers

 ...Entry-Level POSITION OVERVIEW: MCI is one of the fastest-growing tech-enabled business services companies in the USA, with a strong call center footprint and operations that extend across multiple countries. We deliver Customer Experience (CX), Business Process Outsourcing... 

US Air Mobility Command

CHILD AND YOUTH PROGRAM ASSISTANT (ENTRY LEVEL) Job at US Air Mobility Command

 ...Information). Pay will be set based on experience and education and/or certification:...  ...and serves snacks/meals. Executes work in accordance with policies and regulations...  ...religious; spiritual; community; student; social). You will receive credit for all qualifying...