A Streamlit application for Named Entity Recognition (NER) in French medical texts using a fine-tuned DrBERT model.
This project provides a web interface for extracting medical entities from French clinical text using a transformer model fine-tuned on the QUAERO medical corpus. The application supports real-time entity extraction and provides interactive visualizations of the results.
the ner_project.ipynb notebook demonstrates the development and evaluation of various approaches, from traditional machine learning to transformer-based models.
- 🔍 Real-time medical entity extraction
- 📊 Interactive visualizations:
- Entity distribution charts
- Confidence score distributions
- Entity cards with confidence indicators
- 📥 Export results to CSV
- 🎯 Entity types supported:
- DISO (Disorders)
- PROC (Procedures)
- ANAT (Anatomical structures)
- CHEM (Chemicals & Drugs)
- And more...
- Install required packages:
pip install -r requirements.txt- Run the Streamlit app:
streamlit run app.py-
Open the application in your web browser
-
Enter or paste French medical text in the input area
-
Click "Extract Entities" to analyze the text
-
View results in different visualization modes:
- Entity Cards
- Entity Distribution
-
Download results as CSV if needed
- Base model: DrBERT (French medical BERT)
- Fine-tuned on: QUAERO medical corpus
- Hosted on: Hugging Face Hub
- Model ID:
abdel132/ner-drbert-quaero
- streamlit
- transformers
- pandas
- plotly
- torch