Full Stack Data Scientist

Category
Information Technology
ID
2025-2707
Position Type
Regular Full -Time
Telecommute
Yes

Overview

Velocity Clinical Research is an owned and integrated research site organization, providing excellence in patient care, high quality data and fully integrated research sites. At Velocity, we align our values and behaviors to give our employees the best chance of delivering on our brand promise: to bring innovative medical treatments to patients. We are committed to making clinical trials succeed by generating high quality data from as many patients as possible, as quickly as possible while providing exemplary patient care at every step.

 

As an employee of Velocity, you are the most integral part of our mission. For talented candidates who perform at a high level, Velocity will invest to support career advancement and reward performance. Whether you are new to clinical research or are an industry veteran, we invite you to apply to Velocity.

 

As a key member of our AI team, you will own the end-to-end AI/ML pipelines and be instrumental in integrating Generative AI (GenAI) technologies. You will work with technical product managers, chief data architect, and business users to build and productionize AI and LLM-based models that support data-driven decision-making and smart automation. You will leverage tools such as GitHub Copilot, Cursor, WindSurf, and modern GenAI frameworks to deliver intelligent, scalable solutions.

Responsibilities

  • Engineer and manage end-to-end data and AI/GenAI pipelines, including data extraction, cleansing, transformation, training, and model deployment
  • Perform data analysis, LLM prompt design, and fine-tuning
  • Integrate and productionize LLM-based solutions and support RAG pipelines for scalable inference
  • Conduct and iterate on LLM evaluations (Evals) for model accuracy, relevance, and reliability
  • Develop advanced dashboards and visualizations for operational and strategic insights
  • Use dev tools such as GitHub Copilot, Cursor, WindSurf, and others to enhance development productivity
  • Design and implement model monitoring and feedback loop mechanisms to continuously improve system performance and reliability
  • Stay current with the latest in Generative AI, data science, and machine learning to evaluate and apply innovations to projects

Qualifications

Education:

  • B.Tech in Computer Science or a related field
  • Advanced diploma in Data Science, Machine Learning, or related area

Experience:

  • 3+ years of experience building and deploying AI/ML models
  • 3+ years of experience in building scalable data pipelines
  • 3+ years in developing advanced visualizations that deliver impactful insights
  • Experience working with LLM APIs (e.g., OpenAI, Hugging Face)

Preferred Skills:

  • Strong grasp of AI/ML and GenAI techniques including prompt engineering, RAG (Retrieval-Augmented Generation), and LLM evaluations (Evals)
  • Understanding of vector databases and embedding models
  • Familiarity with cloud-native development (AWS, GCP, Azure)
  • Strong understanding of databases, data lakes, and data warehouses
  • Proficiency with development tools such as GitHub Copilot, WindSurf, Cursor, Replit, and related platforms

NOTE: The above Job Description is intended to communicate the general function of the mentioned position and by no means should be considered an exhaustive or complete outline of the specific tasks and functions that will be required. Additionally, specific tasks and duties of the position are subject to change as the Company, the department and circumstances change. All employees are expected to perform their duties within their ability as required by the job and/or as requested by management

 

Options

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed