For our client, a prominent American multinational firm specializing in IT services and consulting, headquartered in New Jersey and listed on NASDAQ, we are looking for Hadoop Site Reliability Engineers who embrace solving complex challenges on a global scale. As a Site Reliability Engineer, you will be an integral part of a cross-functional team inventing, designing, building, and testing software products that reach a truly global customer base. While supporting components of cutting-edge payment technology, you will get to see your efforts shaping the digital future of monetary transactions.
The Offer:
- Contract-based position for 8-12 months.
- Hybrid work model with 2-3 days in the office located in Warsaw.
- B2B contract up to 210 PLN / h + VAT
As a Hadoop SRE Engineer you will:
- Carry out SRE duties for Big Data on various open-source platforms such as Hadoop, Spark, and HBASE.
- Keep an eye on the platforms and adhere to runbooks/SOPs to manage platform and application problems.
- Familiarize yourself with the cluster maintenance processes and implement changes as per the documented installation and validation plans.
- Showcase robust troubleshooting and debugging skills, aiming to pinpoint and rectify the issue, while also offering advice on how to prevent such problems in the future.
- Conduct thorough root cause analysis of major production incidents, document for future reference, and put in place proactive measures to enhance system reliability.
- Automate routine tasks using scripts or automation tools to lessen manual work, decrease the chance of human errors, and boost system reliability.
What do you need?
- At least 2-3 years of experience for a junior level role and 5+ for mid-level/senior level working as a Hadoop Site reliability engineer.
- High level Knowledge on Hadoop platforms and core Hadoop components.
- Troubleshooting both Hadoop platform service, application problems and identifying the root cause.
- Writing ansible playbooks and automate manual tasks using Ansible, shell scripting and python scripting.
- Should be familiar with Unix/Linux system internals, networking, and distributed systems.