Skip to content

A vector database system for the International Classification of Diseases (ICD) and International Classification of Health Interventions (ICHI)

License

Notifications You must be signed in to change notification settings

SEBK4C/ICD-DB-Vec-NLP-Student-Project

Repository files navigation

ICD Vectorization for Open Doctor - NLP Student Project

Project Overview

This project aims to develop a vector database and search system for the International Classification of Diseases (ICD-11) database, enabling natural language querying for medical conditions and appropriate interventions. This work is part of the larger Open Doctor project, which provides AI-powered medical assistance while maintaining patient privacy through local processing.

Project Documents

Core Documentation

Student Role Documents

Project Deadline

The final project submission is due on May 25, 2025.

Getting Started

  1. Review the Project Overview to understand the overall goals
  2. Read your specific role document to understand your responsibilities
  3. Study the Technical Architecture for system design details
  4. Look at the Sample Code for implementation starting points
  5. Participate in the first team meeting to discuss approach and assignments

Core Technologies

  • Python: Primary programming language
  • FastAPI: API framework
  • Vector Databases: FAISS, Pinecone, or similar
  • Sentence Transformers: For creating embeddings
  • ICD-11 API: WHO's International Classification of Diseases API (Documentation, API Reference)
  • ICHI Browser: WHO's International Classification of Health Interventions (Browser)
  • GitHub: For code storage and collaboration
  • Docker: For containerization and deployment

Resources

Communication

Team members should establish regular communication channels and meeting schedules. All code and documentation should be maintained in a shared GitHub repository to facilitate collaboration and version control.

For questions about the Open Doctor project or technical guidance, contact SEBK4C.


This project will enable the Open Doctor system to translate natural language descriptions of medical symptoms into standardized ICD codes and suggest appropriate medical interventions, forming a critical component of the broader goal to democratize access to medical knowledge while maintaining patient privacy.

About

A vector database system for the International Classification of Diseases (ICD) and International Classification of Health Interventions (ICHI)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published