CS-25-334 Large Language Models (LLM) for data extraction from clinical notes

About the project

This project aims to improve the extraction of structured data from free-text clinical notes in Electronic Medical Records (EMRs) using Large Language Models (LLMs). It focuses on generating synthetic clinical notes to facilitate data extraction and sharing while maintaining patient privacy. By using curated templates and LLMs, the project creates synthetic notes that mimic real ones without exposing Protected Health Information (PHI). It also involves fine-tuning LLMs to enhance data extraction accuracy and plans to validate synthetic notes through a Turing Test-style experiment. Future developments include expanding the tool to support various clinical note types and disease sites, and creating a web-based tool for customization.

Folder	Description
Documentation	all documentation the project team has created to describe the architecture, design, installation, and configuration of the project
Notes and Research	Relevant helpful information to understand the tools and techniques used in the project
Project Deliverables	Folder that contains final pdf versions of all Fall and Spring Major Deliverables
Status Reports	Project management documentation - weekly reports, milestones, etc.
scr	Source code - create as many subdirectories as needed

Project Team

Rishabh Kapoor - iHealth Solutions - Sponsor
Preetam Ghosh - Department of Computer Science - Faculty Advisor
Shashank Sinha - Computer Science - Student Team Member
Connor Holden - Computer Science - Student Team Member
August Moses - Computer Science - Student Team Member
Sawiya Aidarus - Computer Science - Student Team Member

Name		Name	Last commit message	Last commit date
Latest commit History 151 Commits
Documentation		Documentation
Notes and Research		Notes and Research
Project Deliverables		Project Deliverables
Status Reports		Status Reports
Turing test tool		Turing test tool
Web Tool		Web Tool
anonymized prostate consult notes		anonymized prostate consult notes
src		src
synthetic note generator		synthetic note generator
.gitignore		.gitignore
README.md		README.md
SyntheticNoteCapstone.docx		SyntheticNoteCapstone.docx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS-25-334 Large Language Models (LLM) for data extraction from clinical notes

About the project

Project Team

About

Releases

Packages

Languages

VCU-CS-Capstone/CS-25-334-llms-for-clinical-notes

Folders and files

Latest commit

History

Repository files navigation

CS-25-334 Large Language Models (LLM) for data extraction from clinical notes

About the project

Project Team

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages