Req I DNE_Hadoop
Primary SkillsDevOps, 200+ hadoop node clusters, UNIX, MariaDB / MySQL, PostgreSQL, Vertica, YARN
‘ Design, implement, and support a high-performance, highly-available infrastructure. ‘ Improve the efficiency and flexibility of our datacenters. ‘ Build and maintain models for growth and capacity planning. ‘ Tune large-scale data clusters for optimal performance and efficiency. ‘ 24/7/365 on-call rotation. ‘ Own the day-to-day health, uptime, monitoring, and reliability of all data platforms and database systems. ‘ Work closely with project management and engineering peers to develop innovative technical tools and solutions. ‘ Identify tactical issues and react to emerging areas of concern. ‘ Adhere to a DevOps philosophy by evangelizing communication, collaboration, and integration with software development teams. ‘ Think long-term and be unsatisfied with band-aids. ‘ Identify unnecessary complexity and remove it.
‘ “should have experience running 200+ hadoop node clusters” ‘ At least four years’ experience in Data or Database Operations, Site Reliability Engineering, System Administration or equivalent roles. ‘ Demonstrated experience in network and large scale UNIX system troubleshooting and maintenance practices. ‘ Capability to script and automate solutions with strong competence in at least one programming language. ‘ Solid knowledge of UNIX command-line tools. ‘ Firm grasp of storage protocols and filesystems. ‘ Deep experience installing and managing one or more of the following: Hadoop clusters and related services, RDBMS platforms (e.g. MariaDB / MySQL, PostgreSQL, Vertica), distributed data systems (e.g. Riak, Druid, Kafka)
‘ Implementation and management of monitoring and metrics tools (e.g. Nagios, Graphite, Grafana, SumoLogic). ‘ Excellent organizational skills and the ability to work in a fast-paced and hectic work environment. ‘ Capable of technical deep-dives into code, networking, systems, and storage with SRE and software engineering. ‘ Willing to occasionally travel to different office locations. ‘ Knowledge and interest in the latest system architecture trends. ‘ Ability to learn and understand new systems. ‘ Ability to communicate effectively and write accurate, clear documentation. ‘ Humility and integrity.
Nice to have:
‘ Running and troubleshooting Erlang, Java or Python applications. ‘ Hardware configurations for data systems. ‘ Other operational data technologies like HBase, Spark, Redis and RabbitMQ. ‘ Analytical data platforms like Vertica and MicroStrategy. ‘ Hadoop-based computational technology like YARN and Impala. ‘ Configuration management systems like Salt Stack and Chef. ‘ Container technology like Docker and Mesos. ‘ Experience with Cloudera. ‘ Agile development practices.