Senior/Principal Data Scientist, Veeva

NLP

Salary not provided
MongoDB
AWS
Docker
Kubernetes
GCP
Python
Azure
Spark
NLTK
SpaCy
PyTorch
Junior, Mid and Senior level
Remote in Spain, Netherlands, UK

More information about location

Veeva

Cloud-based software for life sciences

Open for applications

Veeva

Cloud-based software for life sciences

1001+ employees

B2BEnterpriseSaaSCloud ComputingScience

Open for applications

Salary not provided
MongoDB
AWS
Docker
Kubernetes
GCP
Python
Azure
Spark
NLTK
SpaCy
PyTorch
Junior, Mid and Senior level
Remote in Spain, Netherlands, UK

More information about location

1001+ employees

B2BEnterpriseSaaSCloud ComputingScience

Company mission

Building the Industry Cloud for Life Sciences

Role

Who you are

  • 4+ years of experience as a data scientist (or 2+ years with a Ph.D. degree)
  • Master's or Ph.D. in Computer Science, Artificial Intelligence, Computational Linguistics, or a related field
  • Strong theoretical knowledge of Natural Language Processing, Machine Learning, and Deep Learning techniques
  • Proven experience working with large language models and transformer architectures, such as GPT, BERT, or similar
  • Familiarity with large-scale data processing and analysis, preferably within the medical domain
  • Proficiency in Python and relevant NLP libraries (e.g., NLTK, SpaCy, Hugging Face Transformers)
  • Experience in at least one framework for BigData (e.g., Ray, Spark) and one framework for Deep Learning (e.g., PyTorch, JAX)
  • Experience working with cloud infrastructure (e.g., AWS, GCP, Azure) and containerization technologies (e.g., Docker, Kubernetes) and experience with bashing script
  • Strong collaboration and communication skills, with the ability to work effectively in a cross-functional team
  • Used to start-up environments
  • Social competence and a team player
  • High energy and ambitious
  • Agile mindset

Desirable

  • Background in Medical NLP
  • Experience with training, fine-tuning, and serving Large Language Models
  • Experience in life/health science industry, notably pharma
  • Having published in AI space in a peer-reviewed journal
  • Production-grade development Skills
  • Leadership skills and a solid network to help in hiring and growing the team
  • Experience with NoSQL databases, especially MongoDB
  • Familiarity with model registry solutions such as MLflow
  • Familiarity with distributed computing platforms such as Ray and Spark

What the job involves

  • Your role will primarily involve developing LLM-based agents that are specialized in searching and extracting detailed information about Key Opinion Leaders (KOLs) in the healthcare sector
  • You will craft an end-to-end human-in-the-loop pipeline to sift through a large array of unstructured medical documents—ranging from academic articles to clinical guidelines and meeting notes from therapeutic committees
  • These agents will be equipped to perform semantic searches and provide precise answers to predefined queries concerning KOL-related data across various languages and disciplines
  • Utilizing cloud infrastructure, you will build models capable of information extraction and question answering
  • You will also collaborate with a dedicated team of software developers and DevOps engineers to refine these models and deploy them into production environments
  • Adopt the latest technologies and trends in NLP to your platform
  • Develop LLM-based agents capable of performing function calls and utilizing tools such as browsers for enhanced data interaction and retrieval
  • Experience with Reinforcement Learning from Human Feedback (RLHF) methods such as Direct Preference Optimization (DPO) and Proximal Policy Optimization (PPO) for training LLMs based on human preferences
  • Design, develop, and implement an end-to-end pipeline for extracting predefined categories of information from large-scale, unstructured data across multi-domain and multilingual settings
  • Create a robust semantic search functionality that effectively answers user queries related to various aspects of the data
  • Use and develop named entity recognition, entity-linking, slot-filling, few-shot learning, active learning, question/answering, dense passage retrieval, and other statistical techniques and models for information extraction and machine reading
  • Deeply understand and analyze our data model per data source and geo-region and interpret model decisions
  • Collaborate with data quality teams to define annotation task metrics, and perform qualitative and quantitative evaluation
  • Utilize cloud infrastructure for model development, ensuring seamless collaboration with our team of software developers and DevOps engineers for efficient deployment to production

Our take

Veeva was created out of a predicted need for cloud-based enterprise software at a time when this software was nascent, and when current systems were archaic and cumbersome to use. By focusing on the life sciences sector, the company fills a niche to help some of the most operationally complex businesses run efficiently.

Veeva has managed to position itself as the go-to solution for large and growing companies in the life sciences sector by offering fully comprehensive cloud solutions that support the most critical functions from R&D to commercial.

The company has over 800 customers, ranging from the world’s largest pharmaceutical companies, such as AstraZeneca, to emerging biotechs. For future growth, Veeva looks to the continual development of its services and expansion of its customer base. Its 2024 launch of the Veeva Compass Suite for commercial data products gives a more comprehensive view of patient information and exemplifies how Veeva is strengthening its product to meet future challenges.

Freddie headshot

Freddie

Company Specialist at Welcome to the Jungle

Insights

Some candidates hear
back within 2 weeks

18% employee growth in 12 months

Company

Company benefits

  • Fitness and wellness reimbursement
  • 2% salary towards personal development
  • Childcare vouchers
  • Work anywhere
  • Home internet reimbursement
  • Private Medical Insurance 🇬🇧
  • Cash Health Plan 🇬🇧
  • Life Assurance 🇬🇧
  • Income protection 🇬🇧
  • Auto Enrolment Pension Scheme 🇬🇧
  • 25 days + Public Holidays + 3 day festive break 🇬🇧

Company values

  • Do the Right Thing
  • Customer Success
  • Employee Success
  • Speed

Company HQ

Pleasanton, CA

Leadership

Matt Wallach

(Board Member)

Previously Chief Marketing Officer at Health Market Science for 2 years and General Manager - Pharmaceuticals & Biotechnology at Siebel Systems for 5 years.

Previously Staff Developer at IBM for 5 years and SVP of Technology at salesforce.com for 2 years

Diversity, Equity & Inclusion at Veeva

Eric Seburyamo headshot

Eric Seburyamo (Chief Diversity Officer)

  • Veeva is committed to fostering a culture of inclusion and growing a diverse workforce. Diversity Makes Us Stronger It brings forth diverse perspectives and new ideas, and fuels innovation. Diversity also makes Veeva an exciting and fun place to work. Diversity comes in many forms. Gender, race, ethnicity, religion, politics, sexual orientation, age, and life experience shape us all into unique people. At Veeva, we respect the individual first and value our people for who they are and their unique contributions they bring to our teams. Veeva strives to foster a culture of inclusion where everyone feels comfortable being their true selves and can do their best work.
  • Diversity Communities: Our people shape Veeva culture. As part of our commitment to fostering a culture of inclusion, Veeva has launched the Veeva Diversity Communities. The employee-led communities are actively involved in Veeva’s efforts to grow a diverse workforce, raise awareness of social issues and celebrate our global team's diverse cultures and backgrounds.
  • Company-wide webinars focused on Diversity and Inclusion: We produce webinars and fireside chats highlighting inspiring people inside and outside of Veeva on topics related to diversity, inclusion, and leadership.
  • Internal training for managers focused on Diversity and Inclusion We developed an internal training focused on key concepts for executing our diversity and inclusion goals. The training focuses on: Why diversity, equity and inclusion is critical to workplace success for both you and your team Best practices for incorporating diversity and inclusion into our hiring and team development processes How biases, stereotypes and microaggressions can impact the employee experience Actionable steps you can take to practice inclusive leadership
  • Talent Attraction: Expanding our talent attraction efforts and training our talent attraction team to ensure we are engaging with a diverse talent pipeline to better our chances of hiring the best people

Share this job

View 191 more jobs at Veeva