Multi-Task Learning with NLP

A comprehensive multi-task learning system for Natural Language Processing that simultaneously detects emotions, violence, and hate speech in text data.

🎯 Project Overview

This project implements a deep learning model that can perform three different NLP tasks simultaneously:

Emotion Detection: Classifies text into 6 emotions (sadness, joy, love, anger, fear, surprise)
Violence Detection: Identifies 5 types of violence (sexual, physical, emotional, harmful traditional practices, economic)
Hate Speech Detection: Detects offensive speech, neutral content, and hate speech

🚀 Features

Multi-Task Learning: Single model handles multiple NLP tasks efficiently
Web Interface: User-friendly Flask web application
Real-time Prediction: Instant text analysis with confidence scores
Comprehensive Evaluation: Detailed metrics and visualizations
Modular Architecture: Well-organized, maintainable code structure

📁 Project Structure

├── dataset_load.py          # Dataset loading and preprocessing
├── data_preprocessing.py    # Text cleaning and tokenization
├── model.py                # Multi-task neural network architecture
├── train.py                # Training pipeline
├── test.py                 # Testing and prediction utilities
├── evaluate.py             # Model evaluation and metrics
├── utils.py                # Utility functions
├── config.py               # Configuration settings
├── app.py                  # Flask web application
├── templates/
│   └── index.html          # Web interface
├── requirements.txt        # Python dependencies
└── README.md              # This file

🛠️ Installation

Clone the repository:

git clone https://github.com/uzzal2200/NLP-Multi-Task-Learning-System.git
cd multi-task-nlp-project

Install dependencies:
```
pip install -r requirements.txt
```

Download NLTK data (if not already downloaded):

import nltk
nltk.download('punkt')
nltk.download('stopwords')

📊 Dataset Requirements

Place your datasets in the following structure:

Dataset/
├── Emotion/
│   └── text.csv
├── Violence/
│   └── Train.csv
└── Hate/
    └── labeled_data.csv

Dataset Formats:

Emotion Dataset: Columns should include 'text' and 'label' (0-5)
Violence Dataset: Columns should include 'tweet' and 'type'
Hate Dataset: Columns should include 'tweet' and 'class'

🏃‍♂️ Quick Start

1. Training the Model

python train.py

This will:

Load and preprocess all datasets
Create and train the multi-task model
Save the trained model to Save model/multi_task_model.h5
Save the tokenizer to Save model/tokenizer.pkl

2. Running the Web Application

python app.py

Then open your browser and go to: http://localhost:5000

3. Testing Individual Components

# Test dataset loading
python dataset_load.py

# Test data preprocessing
python data_preprocessing.py

# Test model creation
python model.py

# Test evaluation
python evaluate.py

🎮 Usage Examples

Web Interface

Open the web application
Enter text in the input field
Click "Analyze Text"
View results for all three tasks

Programmatic Usage

from test import MultiTaskPredictor
from utils import load_tokenizer

# Load model and tokenizer
tokenizer = load_tokenizer('Save model/tokenizer.pkl')
predictor = MultiTaskPredictor('Save model/multi_task_model.h5', tokenizer)

# Make predictions
text = "I am so happy today!"
result = predictor.predict_single(text)
print(f"Major Task: {result['major_task']}")
print(f"Recommended Label: {result['recommended_label']}")
print(f"Confidence: {result['recommended_confidence']:.4f}")

📈 Model Architecture

The multi-task model consists of:

Shared Embedding Layer: 128-dimensional word embeddings
Shared LSTM Layer: 64 units with return sequences
Shared Pooling: Global average pooling
Task-Specific Outputs: Dense layers for each task
- Emotion: 6 classes (softmax)
- Violence: 5 classes (softmax)
- Hate: 3 classes (softmax)

🔧 Configuration

Edit config.py to customize:

Dataset paths
Model hyperparameters
Training parameters
Evaluation settings

📊 Evaluation Metrics

The system provides comprehensive evaluation including:

Accuracy: Overall classification accuracy
Precision: Precision for each class
Recall: Recall for each class
F1-Score: Harmonic mean of precision and recall
Confusion Matrices: Visual representation of predictions

🌐 API Endpoints

The Flask application provides these endpoints:

GET /: Main web interface
POST /predict: Text prediction API
GET /health: Health check
GET /model_info: Model information
GET /example_predictions: Example predictions

API Usage Example

curl -X POST http://localhost:5000/predict \
  -H "Content-Type: application/json" \
  -d '{"text": "I am so happy today!"}'

🎯 Performance

Typical performance metrics:

Emotion Detection: ~85-90% accuracy
Violence Detection: ~80-85% accuracy
Hate Speech Detection: ~85-90% accuracy

Note: Performance may vary based on dataset quality and training parameters

🛠️ Customization

Adding New Tasks

Modify model.py to add new output layers
Update config.py with new task configuration
Modify data loading in dataset_load.py
Update evaluation metrics in evaluate.py

Changing Model Architecture

Edit the model creation function in model.py
Adjust hyperparameters in config.py
Retrain the model using train.py

🐛 Troubleshooting

Common Issues:

Model not found error:
- Ensure you've run train.py first
- Check that Save model/ directory exists
Tokenizer not found error:
- The app will auto-generate tokenizer on first run
- Or run train.py to create it manually
Dataset path errors:
- Verify dataset paths in config.py
- Ensure CSV files exist and are readable
Memory issues:
- Reduce batch size in config.py
- Use smaller max_length for sequences

📝 License

This project is open source and available under the MIT License.

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

📞 Support

If you encounter any issues or have questions:

Check the troubleshooting section above
Review the code comments for guidance
Open an issue on the repository

🔄 Version History

v1.0.0: Initial release with multi-task learning implementation
v1.1.0: Added web interface and API endpoints
v1.2.0: Enhanced evaluation and visualization features

Happy Coding! 🚀

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Task Learning with NLP

🎯 Project Overview

🚀 Features

📁 Project Structure

🛠️ Installation

📊 Dataset Requirements

Dataset Formats:

🏃‍♂️ Quick Start

1. Training the Model

2. Running the Web Application

3. Testing Individual Components

🎮 Usage Examples

Web Interface

Programmatic Usage

📈 Model Architecture

🔧 Configuration

📊 Evaluation Metrics

🌐 API Endpoints

API Usage Example

🎯 Performance

🛠️ Customization

Adding New Tasks

Changing Model Architecture

🐛 Troubleshooting

Common Issues:

📝 License

🤝 Contributing

📞 Support

🔄 Version History

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Dataset		Dataset
Notebook		Notebook
Save model		Save model
__pycache__		__pycache__
templates		templates
.gitignore		.gitignore
Procfile		Procfile
README.md		README.md
app.py		app.py
config.py		config.py
data_preprocessing.py		data_preprocessing.py
dataset_load.py		dataset_load.py
deploy_helper.py		deploy_helper.py
deploy_pythonanywhere.py		deploy_pythonanywhere.py
evaluate.py		evaluate.py
model.py		model.py
railway.toml		railway.toml
render.yaml		render.yaml
requirements.txt		requirements.txt
runtime.txt		runtime.txt
test.py		test.py
train.py		train.py
utils.py		utils.py

uzzal2200/NLP-Multi-Task-Learning-System

Folders and files

Latest commit

History

Repository files navigation

Multi-Task Learning with NLP

🎯 Project Overview

🚀 Features

📁 Project Structure

🛠️ Installation

📊 Dataset Requirements

Dataset Formats:

🏃‍♂️ Quick Start

1. Training the Model

2. Running the Web Application

3. Testing Individual Components

🎮 Usage Examples

Web Interface

Programmatic Usage

📈 Model Architecture

🔧 Configuration

📊 Evaluation Metrics

🌐 API Endpoints

API Usage Example

🎯 Performance

🛠️ Customization

Adding New Tasks

Changing Model Architecture

🐛 Troubleshooting

Common Issues:

📝 License

🤝 Contributing

📞 Support

🔄 Version History

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages