This job posting isn't available in all website languages

(While navigating through the site, please be sure to disable your pop-up blocker.)

Hadoop Administrator - Institutional Analytics & Informatics

Information Technology
129227 Requisition #
Share this Job
Sign Up for Job Alerts

Institutional  Analytics & Informatics Vision:

Enable informatics and analytics across service lines by providing the right information to the right people at the right time with the right tools. The Insitutional  Analytics & Informatics team is focused on building the infrastructure and creating the capabilities necessary for sustainable clinical data delivery.  With ongoing changes in the national health care environment, data will be increasingly required to optimize medical practice.  Effective health care delivery must be of high quality and at a reasonable cost.  Clinical information management using enterprise-wide vocabularies, data modeling, business intelligence solutions and natural language processing are key enabling technologies.  We are the enterprise data architects and infrastructure experts.

The Hadoop Administrator provides technical expertise and guidance in the following areas:

1) Hadoop System design

2) strategy and vision 

3) Hadoop standards and best practices 

4) Hadoop production support 

5) Unix coding 

6) Hortonworks administration and;

7) Big Data techniques.

The Hadoop Administrator is responsible for comprehending Hadoop architecture, design and maintenance as well as providing solutions through development, thorough testing and documentation. This person must also have the ability to communicate effectively with their customers, peers and management and be able to handle multiple tasks at one time.

All of the Essential Functions listed below require the following physical and mental skills and interfacing with CAI staff, other Institutional staff, other State Of Texas employees, vendors, contractors, visitors, and the general public.  Mathematical and business organization knowledge and analytical thinking, verbal and written communication including diagrams/memos, hand typing on a typewriter and PC keyboard, hand drawing/writing and visual reading on paper, white board, and chalk board, traveling between several buildings requiring exposure to outside environmental elements, influencing integration best practices with both departmental and institutional staff. 

Must have understanding of information systems with a specific focus on data integration principles and be able to communicate knowledge of same to co-workers effectively.  Must be adaptable to change and able to interact with co-workers and customers in a positive manner, as well as communicate in an effective manner.



Hadoop Architecture:
Participates in reviewing business requirements, architecture design and proposed solutions in accordance with our Hadoop standards and requirements. Tasks include Hadoop architecture and design, security design, Kerberos, High Availability configuration and disaster recovery plans. Provides direction on the strategic vision of the Hadoop environment. Advises on proper environment installation, configuration, maintenance, security and monitoring of the Hadoop environment. Displays good working knowledge of the Hortonworks distribution of Hadoop and its architecture.

Hadoop Development and Design:
Develops and designs in accordance with departmental and institutional standards and guidelines for ingesting, transforming, governing and analyzing data. Development tasks include but are not limited to coding, troubleshooting, testing, documenting and enforcing Hadoop standards according to M. D. Anderson Information Services policies and procedures and best practices. Conducts review of other integration administrators’ efforts to ensure consistent methodologies/standards and makes recommendations where necessary. Efforts include job scheduling, custom code development, performance tuning, Hadoop component installation and configuration, design and development of schemas, models in the Hadoop components and coordinating activities between internal teams, customers and other departments. Apply knowledge of Big Data integration and streaming components and practices, which include writing or tuning python, Scala, Java or Spark code.  

Hadoop Production Support:
Actively participates in the installation, configuration, monitoring, maintenance and administration of the Hadoop environment. Understanding of Linux and the Unix operating system, cpu, memory. Provides after hours support (on-call) as needed for daily, weekly, and monthly processes. Responsible for system availability, performance, code migration, backups, disaster recovery and security. Thorough understanding of Kerberos, Ranger and Knox security.

Other duties as assigned


Education Required:

Bachelor's degree. 

Preferred Education:

Required degree plus coursework towards an MS, MIS, or MBA

Experience Required:

Four years of integration programming and/or systems level experience. May substitute required education degree with additional years of equivalent experience on a one to one basis.


Preferred Experience:

Five years of Hadoop administration and/or systems level experience.

License / Certification Required:


Preferred Certification:  

Hadoop Administrator certification

Onsite Presence: Is Required

It is the policy of The University of Texas MD Anderson Cancer Center to provide equal employment opportunity without regard to race, color, religion, age, national origin, sex, gender, sexual orientation, gender identity/expression, disability, protected veteran status, genetic information, or any other basis protected by institutional policy or by federal, state or local laws unless such distinction is required by law. http://www.mdanderson.org/about-us/legal-and-policy/legal-statements/eeo-affirmative-action.html

My Submissions

Track your opportunities.

My Submissions

Similar Listings

United States, Texas, Houston, Houston (TX Med Ctr)

📁 Information Technology

Requisition #: 127654

Imaging Physics 600668

United States, Texas, Houston, Houston (TX Med Ctr)

📁 Information Technology

Requisition #: 130536

United States, Texas, Houston, Houston (TX Med Ctr)

📁 Information Technology

Requisition #: 130588