Genentech
2024 Summer Intern - Large Language Model (LLM) & Knowledge Graph
This job is now closed
Job Description
- Req#: 202312-129189
Drive the application of best practices for algorithm development
Train large-scale experiments using cloud and HPC infrastructure
Deliver high-quality code and actively participate in code reviews
Keep accurate and timely records of research findings and progress
Intensive 12-week, full-time (40 hours per week) paid internship
Program start dates are in May/June (Summer)
A stipend, based on location, will be provided to help alleviate costs associated with the internship
Ownership of challenging and impactful business-critical projects
Work with some of the most talented people in the biotechnology industry
Must be pursuing a Master's Degree (enrolled student)
Must have attained a Master's Degree
Must be pursuing a PhD (enrolled student)
Must have attained a PhD
Computer Science, Data Science, Informatics, Bioinformatics, Health Informatics or related fields
Computer programming skills (highly proficient)
State-of-the-art Deep Learning and Machine Learning algorithms
Experience in Knowledge representation and reasoning, knowledge graphs, and graph neural networks
Quick learner and biology knowledge
Excellent communication, collaboration, and interpersonal skills.
Complements our culture and the standards that guide our daily behavior & decisions: Integrity, Courage, and Passion
2024 Summer Intern - Large Language Model (LLM) & Knowledge Graph
Department Summary
The AI, Cloud, and Engineering (ACE) Team at Genentech is seeking a highly motivated intern to work on Large Language Model (LLM)/Artificial Intelligence (AI).
Recent advancements in machine learning have led to impressive results in computer vision and natural language processing, where the combination of large amounts of data and computational resources has enabled the development of highly convincing generative models. However, in the field of healthcare, where data is often scarce, heterogeneous, and multimodal, these models have been less successful in capturing the underlying data generating mechanisms, or the causal relations.
The project is to build a LLM-based prediction model using a large amount of unlabeled data and can be adapted to a broad range of downstream tasks. LLMs can offer a promising approach for inference, particularly in cases where structured data and sample size are limited. This approach could have potentially significant implications for advancing biomedical research where obtaining abundant training clinical data is not readily possible. By leveraging the vast amounts of unstructured data available in the field, LLMs can help researchers bypassing the challenge of limited training data when building data-driven computational models.
This internship position is located in South San Francisco, CA on-site .
Key Responsibilities
Program Highlights
Who You Are (Required)
Required education:
You meet one of the following criteria:
Required majors:
Required skills:
Preferred Qualifications
Relocation benefits are not available for this job posting.
The expected salary range for this position based on the primary location for this position in California is $50.00 per hour . Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. This position also qualifies for paid holiday time off benefits.
#GNE-gCS-2024-Interns
Genentech is an equal opportunity employer, and we embrace the increasingly diverse world around us. Genentech prohibits unlawful discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin or ancestry, age, disability, marital status and veteran status.
About the company
Genentech, Inc., is an American biotechnology corporation which became a subsidiary of Roche in 2009.