Mindlance

Sr. Hadoop Data Engineer / Administrator


PayCompetitive
LocationJacksonville/Florida
Employment typeFull-Time
  • Job Description

      Req#: 25-73014

      100% Remote

      New update from HM:
      Breakdown of work: 30% Hadoop Admin work 70% Data Science/Analytics (SQL)
      HM is now requiring the following:
      •Must have analytics experience with data science and R
      •Experience with Starburst/TrinoPresto specialization highly preferred


      Core Business Hours: 8am-5pm (On-call rotation) 24x7 rotation (once every 4 weeks)
      Interview Comments: 2 rounds via MS Teams (audio & video required) (supplier participation required)


      Project Scope: Iceberg Implementation and other projects for EDS. This will be a heavy analytics-based project, involving SQL support specifically.

      •3-5 Years of DBA experience required
      •5-8 Years of Hadoop Admin experience required

      •Strong communication skills required
      •Must have strong experience working with SQL (our analytics platform)
      •Must have strong experience working with Spark (main platform)
      •Must have strong experience working with Linux OS

      •Exposure to Data Science tools highly preferred (but not required)

      Description:
      The Hadoop Administrator administers database systems to protect the confidentiality, integrity and availability of data. The Hadoop Administrator is responsible for installations, upgrades, backups, and configuration. They also maintain query language for established database systems including Starburst Trino , design and execute backup and recovery schemes and implement disaster recovery BDR for Hadoop procedures. The Administrator works with end-users and project team members to understand and advise on database Impala, Hive and Hbase and query requirements and data science query capabilities. They are assigned to complex database systems including those that work on Data science Linux servers and are relied upon to optimize database performance, R configuration, access and security and to serve on project teams contributing database subject matter expertise.

      Responsibilities:
      • Responsible for implementation and ongoing administration of Hadoop infrastructure, Data science infrastructure and Lakehouses
      • Aligning with the systems engineering team to propose and deploy new hardware and software environments required for Hadoop and to expand existing environments to Lakehouse technologies
      • Working with data delivery teams to setup new Hadoop users. This job includes setting up Linux users, setting up Kerberos principals and testing HDFS, Hive, Pig and MapReduce access for the new users.
      • Cluster maintenance as well as creation and removal of nodes using tools like Ganglia, Nagios, Cloudera Manager Enterprise, Dell Open Manage and other tools.
      • Performance tuning of Hadoop clusters and Hadoop MapReduce routines.
      • Screen Hadoop cluster job performances and capacity planning
      • Monitor Hadoop cluster connectivity and security
      • Manage and review Hadoop log files.
      • File system management and monitoring.
      • HDFS support and maintenance.
      • Diligently teaming with the infrastructure, network, database, application and business intelligence teams to guarantee high data quality and availability.
      • Collaborating with application teams to install operating system and Hadoop updates, patches, version upgrades when required.
      • Point of Contact for Vendor escalation

      DBA Responsibilities Performed by a Hadoop Administrator:
      • Data modelling, design & implementation based on recognized standards.
      • Software installation and configuration.
      • Database backup and recovery.
      • Database connectivity and security.
      • Performance monitoring and tuning.
      • Disk space management.
      • Software patches and upgrades.
      • Automate manual tasks.

      Required Experience:
      •3-5 Years of DBA experience required
      •5-8 Years of Hadoop Admin experience required

      •Strong communication skills required
      •Must have strong experience working with SQL (our analytics platform)
      •Must have strong experience working with Spark (main platform)
      •Must have strong experience working with Linux OS

      •Exposure to Data Science tools highly preferred (but not required)

      Required Education:
      •Related Bachelor's degree in an IT related field or relevant work experience

      EEO:
      “Mindlance is an Equal Opportunity Employer and does not discriminate in employment on the basis of – Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans.”

  • About the company

      Mindlance is one of the largest diversity-owned staffing firms in the US . As a recruitment centric talent acquisition company, Mindlance provides Technology, Engineering, Digital / Creative / Marketing, Clinical Research, Scientific, Finance, Professional and Payroll Management staffing services to Global 1000 companies across the US, Canada and India.

Notice

Talentify is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.

Talentify provides reasonable accommodations to qualified applicants with disabilities, including disabled veterans. Request assistance at accessibility@talentify.io or 407-000-0000.

Federal law requires every new hire to complete Form I-9 and present proof of identity and U.S. work eligibility.

An Automated Employment Decision Tool (AEDT) will score your job-related skills and responses. Bias-audit & data-use details: www.talentify.io/bias-audit-report. NYC applicants may request an alternative process or accommodation at aedt@talentify.io or 407-000-0000.