Returning Candidate?

Full Stack Data Scientist

Category: Information Technology
ID: 2025-2707
Position Type: Regular Full -Time
Telecommute: Yes

Overview

Velocity Clinical Research is an owned and integrated research site organization, providing excellence in patient care, high quality data and fully integrated research sites. At Velocity, we align our values and behaviors to give our employees the best chance of delivering on our brand promise: to bring innovative medical treatments to patients. We are committed to making clinical trials succeed by generating high quality data from as many patients as possible, as quickly as possible while providing exemplary patient care at every step.

As an employee of Velocity, you are the most integral part of our mission. For talented candidates who perform at a high level, Velocity will invest to support career advancement and reward performance. Whether you are new to clinical research or are an industry veteran, we invite you to apply to Velocity.

As a key member of our AI team, you will own the end-to-end AI/ML pipelines and be instrumental in integrating Generative AI (GenAI) technologies. You will work with technical product managers, chief data architect, and business users to build and productionize AI and LLM-based models that support data-driven decision-making and smart automation. You will leverage tools such as GitHub Copilot, Cursor, WindSurf, and modern GenAI frameworks to deliver intelligent, scalable solutions.

Responsibilities

Engineer and manage end-to-end data and AI/GenAI pipelines, including data extraction, cleansing, transformation, training, and model deployment
Perform data analysis, LLM prompt design, and fine-tuning
Integrate and productionize LLM-based solutions and support RAG pipelines for scalable inference
Conduct and iterate on LLM evaluations (Evals) for model accuracy, relevance, and reliability
Develop advanced dashboards and visualizations for operational and strategic insights
Use dev tools such as GitHub Copilot, Cursor, WindSurf, and others to enhance development productivity
Design and implement model monitoring and feedback loop mechanisms to continuously improve system performance and reliability
Stay current with the latest in Generative AI, data science, and machine learning to evaluate and apply innovations to projects

Qualifications

Education:

B.Tech in Computer Science or a related field
Advanced diploma in Data Science, Machine Learning, or related area

Experience:

3+ years of experience building and deploying AI/ML models
3+ years of experience in building scalable data pipelines
3+ years in developing advanced visualizations that deliver impactful insights
Experience working with LLM APIs (e.g., OpenAI, Hugging Face)

Preferred Skills:

Strong grasp of AI/ML and GenAI techniques including prompt engineering, RAG (Retrieval-Augmented Generation), and LLM evaluations (Evals)
Understanding of vector databases and embedding models
Familiarity with cloud-native development (AWS, GCP, Azure)
Strong understanding of databases, data lakes, and data warehouses
Proficiency with development tools such as GitHub Copilot, WindSurf, Cursor, Replit, and related platforms

NOTE: The above Job Description is intended to communicate the general function of the mentioned position and by no means should be considered an exhaustive or complete outline of the specific tasks and functions that will be required. Additionally, specific tasks and duties of the position are subject to change as the Company, the department and circumstances change. All employees are expected to perform their duties within their ability as required by the job and/or as requested by management

Options

Apply for this job onlineApply

Email this job to a friendRefer

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.

Share on your newsfeed

Application FAQs