Spaces:

shegga
/

SentimentAnalysisForNMTTNT

Runtime error

App Files Files Community

SentimentAnalysisForNMTTNT / deploy_package /README.md

shegga

🔧 Fix README configuration for Hugging Face Spaces

ab92e28 about 1 month ago

preview code

raw

history blame contribute delete

4.52 kB

A newer version of the Gradio SDK is available: 5.49.1

Upgrade

metadata

title: Vietnamese Sentiment Analysis
emoji: 🎭
colorFrom: green
colorTo: blue
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false

🎭 Vietnamese Sentiment Analysis

A Vietnamese sentiment analysis web interface built with Gradio and transformer models, optimized for Hugging Face Spaces deployment.

🚀 Features

🤖 Transformer-based Model: Uses 5CD-AI/Vietnamese-Sentiment-visobert from Hugging Face Hub
🌐 Interactive Web Interface: Real-time sentiment analysis via Gradio
⚡ Memory Efficient: Built-in memory management and batch processing limits
📊 Visual Analysis: Confidence scores with interactive charts
📝 Batch Processing: Analyze multiple texts at once
🛡️ Memory Management: Real-time memory monitoring and cleanup

🎯 Usage

Single Text Analysis

Enter Vietnamese text in the input field
Click "Analyze Sentiment"
View the sentiment prediction with confidence scores
See probability distribution in the chart

Batch Analysis

Switch to "Batch Analysis" tab
Enter multiple Vietnamese texts (one per line)
Click "Analyze All" to process all texts
View comprehensive batch summary with sentiment distribution

Memory Management

Monitor real-time memory usage
Use "Memory Cleanup" button if needed
Automatic cleanup after each prediction
Maximum 10 texts per batch for efficiency

📊 Model Details

Model: 5CD-AI/Vietnamese-Sentiment-visobert
Architecture: Transformer-based (XLM-RoBERTa)
Language: Vietnamese
Labels: Negative, Neutral, Positive
Max Sequence Length: 512 tokens
Device: Automatic CUDA/CPU detection

💡 Example Usage

Try these example Vietnamese texts:

"Giảng viên dạy rất hay và tâm huyết." (Positive)
"Môn học này quá khó và nhàm chán." (Negative)
"Lớp học ổn định, không có gì đặc biệt." (Neutral)

🛠️ Technical Features

Memory Optimization

Automatic GPU cache clearing
Garbage collection management
Memory usage monitoring
Batch size limits
Real-time memory tracking

Performance

~100ms processing time per text
Supports up to 512 token sequences
Efficient batch processing
Memory limit: 8GB (Hugging Face Spaces)

📋 Model Performance

The model provides:

Sentiment Classification: Positive, Neutral, Negative
Confidence Scores: Probability distribution across classes
Real-time Processing: Fast inference on CPU/GPU
Batch Analysis: Efficient processing of multiple texts

🔧 Deployment

This Space is configured for Hugging Face Spaces with:

SDK: Gradio 4.44.0
Hardware: CPU (with CUDA support if available)
Memory: 8GB limit with optimization
Model Loading: Direct from Hugging Face Hub

📄 Requirements

See requirements.txt for complete dependency list:

torch>=2.0.0
transformers>=4.21.0
gradio>=4.44.0
pandas, numpy, scikit-learn
psutil for memory monitoring

🎯 Use Cases

Education: Analyze student feedback
Customer Service: Analyze customer reviews
Social Media: Monitor sentiment in posts
Research: Vietnamese text analysis
Business: Customer sentiment tracking

🔍 Troubleshooting

Memory Issues

Use "Memory Cleanup" button
Reduce batch size
Refresh the page if needed

Model Loading

Model loads automatically from Hugging Face Hub
No local training required
Automatic fallback to CPU if GPU unavailable

Performance Tips

Clear, grammatically correct Vietnamese text works best
Longer texts (20-200 words) provide better context
Use batch processing for multiple texts

📝 Citation

If you use this model or Space, please cite the original model:

@InProceedings{8573337,
  author={Nguyen, Kiet Van and Nguyen, Vu Duc and Nguyen, Phu X. V. and Truong, Tham T. H. and Nguyen, Ngan Luu-Thuy},
  booktitle={2018 10th International Conference on Knowledge and Systems Engineering (KSE)},
  title={UIT-VSFC: Vietnamese Students' Feedback Corpus for Sentiment Analysis},
  year={2018},
  volume={},
  number={},
  pages={19-24},
  doi={10.1109/KSE.2018.8573337}
}

🤝 Contributing

Feel free to:

Submit issues and feedback
Suggest improvements
Report bugs
Request new features

📄 License

This Space uses open-source components under MIT license.

Try it now! Enter some Vietnamese text above to see the sentiment analysis in action. 🎭