US Tech Solutions

Data Analyst (Bioinformatics Data Curation Engineer)


PayCompetitive
LocationRemote
Employment typeContract

This job is now closed

  • Job Description

      Req#: 24-09578
      Position Title: Data Analyst (Bioinformatics Data Curation Engineer)
      Length of Contract: 1yr
      Location/Site: LC or remote (Eastern time zone preferred) Hybrid if near LC, Remote otherwise

      What are the top 3-5 skills, experience or education required for this position:
      1. PostgreSQL
      2. Bioinformatics datasets ( BulkRNAseq, CRISPR)
      3. Python/R
      4. Common data models
      5. Code management and documentation

      As a Data Analyst in Genomics Research Center’s Bioinformatics Engineering team, you will be responsible for developing and running workflows to standardize and load project datasets in a centralized environment. You will work closely with Bioinformatics Engineering, as well as therapeutic area facing bioinformatics research scientists to leverage common data models, and support GRC Bioinformaticians’ needs for loading and querying. Your expertise in PostgreSQL for database management and Python and R for scripting and automation will be crucial in developing and maintaining ETL processes to ensure data quality and integrity.

      Responsibilities
      • Develop and maintain a functional understanding of the GRC common data models, loading processes, and requirements, and perform accurate and efficient loading of new and historical datasets into the GRC’s Omics Data Server.
      • Collaborate with Bioinformatics Engineers to develop and implement additional data loading workflows.
      • Partner with Bioinformatics research scientists to identify, process, and load project data into the common data models.
      • Build and execute ETL processes to integrate non-GRC generated high-value datasets into the common data models.
      • Keep thorough documentation for tracking datasets and loading tasks.
      • Ensure reproducibility and facilitate collaboration with team members by documenting and versioning code with git.

      Qualifications
      • Bachelor's degree in computer science, bioinformatics, or a related field +3 years of experience.
      • Experience with building and running workflows for RDMS data loading and ETL processes.
      • Proficient in PostgreSQL (or equivalent) and ability to write complex queries for data extraction and analysis.
      • Strong programming skills in Python for scripting and automation. Additional experience with R is preferred.
      • Familiarity with genomic data formats and databases commonly used in bioinformatics research.
      • Knowledge of data modeling concepts and implementing common data models in a relational database.
      • Familiarity with data cleaning, normalization, and quality control processes.
      • Excellent communication skills and ability to collaborate with researchers and stakeholders.
  • About the company

      Our Talent Your Results - This is the premise behind US Tech Solutions. Our flexible engagement model offers right-fit talent on-demand - when, where and how you need it, so you can achieve your business objectives. We offer contingent, contract-to-hire or direct staffing services. At USTECH, we understand how critical talent is to every organization, as well as how the world of work and the workplace is changing. We offer the most effective means to help you acquire, manage and optimize talent. USTECH was founded in 2000 by Manoj Agarwal. Today, we are a global firm offering talent solutions to 150 customers including 20% of Fortune 500 across Financial Services, Healthcare, Life Sciences, Aerospace, Energy, Retail, Telecom, Technology, Manufacturing, and Engineering.