Spaces:

HackathonCRA
/

mcp

Sleeping

App Files Files Community

Tracy André commited on Sep 18

Commit

dc128e4

1 Parent(s): 7cf64b9

updated

Browse files

Files changed (17) hide show

DATASET_CARD.md +0 -280
IMPLEMENTATION_SUMMARY.md +0 -202
MODEL_CARD.md +0 -296
GOAL.md → PROMPT.md +36 -1
README.md +86 -157
analysis_tools.py +0 -368
app.py +13 -224
demo.py +0 -218
gradio_app.py +0 -474
hf_integration.py +0 -313
hf_usage_example.py +0 -214
launch.py +0 -170
mcp.code-workspace +0 -11
mcp_server.py +267 -404
requirements.txt +5 -7
test_data_sources.py +0 -190
test_hf_only.py +0 -155

DATASET_CARD.md DELETED Viewed

@@ -1,280 +0,0 @@
----
-license: cc-by-4.0
-task_categories:
-- tabular-regression
-- time-series-forecasting
-language:
-- fr
-tags:
-- agriculture
-- herbicides
-- weed-pressure
-- crop-rotation
-- france
-- bretagne
-- sustainability
-- precision-agriculture
-- ift
-- treatment-frequency-index
-size_categories:
-- 1K<n<10K
-pretty_name: "Station Expérimentale de Kerguéhennec - Agricultural Interventions"
-configs:
-- config_name: default
-  data_files:
-  - split: train
-    path: "*.csv"
----
-# 🚜 Station Expérimentale de Kerguéhennec - Agricultural Interventions Dataset
-## Dataset Description
-This dataset contains comprehensive agricultural intervention records from the Station Expérimentale de Kerguéhennec in Brittany, France, spanning from 2014 to 2024. The data provides detailed insights into agricultural practices, crop rotations, herbicide treatments, and field management operations across 100 different plots.
-## Dataset Summary
-- **Source**: Station Expérimentale de Kerguéhennec, Brittany, France
-- **Time Period**: 2014-2024 (10 years)
-- **Location**: Brittany (Bretagne), France
-- **Records**: 4,663 intervention records
-- **Plots**: 100 unique agricultural parcels
-- **Crops**: 42 different crop types
-- **Format**: CSV exports from farm management system
-- **Language**: French (field names and crop types)
-## Primary Use Cases
-This dataset is particularly valuable for:
-1. **🌿 Weed Pressure Analysis**: Calculate and predict Treatment Frequency Index (IFT) for herbicides
-2. **🔄 Crop Rotation Optimization**: Analyze the impact of different crop sequences on pest pressure
-3. **🌱 Sustainable Agriculture**: Support reduction of herbicide use while maintaining productivity
-4. **🎯 Precision Agriculture**: Identify suitable plots for sensitive crops (peas, beans)
-5. **📊 Agricultural Research**: Study relationships between farming practices and outcomes
-6. **🤖 Machine Learning**: Train models for agricultural prediction and decision support
-## Data Structure
-### Core Fields
-| Field | Description | Type | Example |
-|-------|-------------|------|---------|
-| `millesime` | Year of intervention | Integer | 2024 |
-| `nomparc` | Plot/field name | String | "Etang Milieu" |
-| `surfparc` | Plot surface area (hectares) | Float | 2.28 |
-| `libelleusag` | Crop type/usage | String | "pois de conserve" |
-| `datedebut` | Intervention start date | Date | "20/2/24" |
-| `datefin` | Intervention end date | Date | "20/2/24" |
-| `libevenem` | Intervention type | String | "Semis classique" |
-| `familleprod` | Product family | String | "Herbicides" |
-| `produit` | Specific product used | String | "CALLISTO" |
-| `quantitetot` | Total quantity applied | Float | 1.5 |
-| `unite` | Unit of measurement | String | "L" |
-### Derived Fields (Added During Processing)
-| Field | Description | Type |
-|-------|-------------|------|
-| `year` | Standardized year | Integer |
-| `crop_type` | Standardized crop classification | String |
-| `is_herbicide` | Boolean flag for herbicide treatments | Boolean |
-| `is_fungicide` | Boolean flag for fungicide treatments | Boolean |
-| `is_insecticide` | Boolean flag for insecticide treatments | Boolean |
-| `plot_name` | Standardized plot name | String |
-| `intervention_type` | Standardized intervention classification | String |
-## Key Statistics
-### Temporal Coverage
-- **Years**: 2014-2024 (missing 2017 due to data format issues)
-- **Seasons**: All agricultural seasons represented
-- **Frequency**: Multiple interventions per plot per year
-### Spatial Coverage
-- **Plots**: 100 unique agricultural parcels
-- **Surface**: Variable plot sizes (0.43 to 5+ hectares)
-- **Location**: Single experimental station (controlled conditions)
-### Intervention Types
-- **Herbicide applications**: 800+ treatments
-- **Total interventions**: 4,663 records
-- **Product families**: Herbicides, Fungicides, Insecticides, Fertilizers
-- **Most common crops**: Wheat, Corn, Rapeseed
-## Treatment Frequency Index (IFT)
-### Definition
-The IFT (Indice de Fréquence de Traitement) is a key metric calculated as:
-```
-IFT = Number of applications / Plot surface area
-```
-### Interpretation
-- **IFT < 1.0**: Low weed pressure (suitable for sensitive crops)
-- **IFT 1.0-2.0**: Moderate pressure (monitoring required)
-- **IFT > 2.0**: High pressure (intervention needed)
-### Dataset Statistics
-- **Mean IFT**: 1.93 (moderate pressure)
-- **Range**: 0.14 - 6.67
-- **Trend**: Decreasing from 2.91 (2014) to 1.74 (2024)
-## Data Quality
-### Completeness
-- **Core fields**: 95%+ completeness for essential variables
-- **Date fields**: Well-formatted and consistent
-- **Numeric fields**: Validated ranges and units
-- **Geographic data**: Anonymized but consistent plot identifiers
-### Validation
-- **Cross-references**: Product codes validated against official databases
-- **Temporal consistency**: Logical intervention sequences
-- **Agronomic validity**: Realistic crop rotations and treatment patterns
-### Limitations
-- **Geographic scope**: Single experimental station (limited geographic diversity)
-- **Weather data**: Not included (external source required)
-- **Economic data**: Treatment costs not provided
-- **Soil characteristics**: Limited soil type information
-## Ethical Considerations
-### Privacy Protection
-- **Location data**: Generalized to protect farm location
-- **Personal information**: All farmer identifying data removed
-- **Commercial sensitivity**: Product usage patterns aggregated when appropriate
-### Bias Considerations
-- **Geographic bias**: Limited to Brittany region
-- **Temporal bias**: Recent years may have different practices
-- **Selection bias**: Experimental station may not represent typical farms
-- **Technology bias**: Practices may reflect research station capabilities
-## Applications
-### 1. Weed Pressure Prediction
-Use machine learning models to predict future IFT values based on:
-- Historical treatment patterns
-- Crop rotation sequences
-- Environmental factors
-- Plot characteristics
-**Example Model Performance**:
-- Random Forest Regressor: R² = 0.65-0.85
-- Features: Year, plot surface, previous IFT, crop type, rotation sequence
-### 2. Sustainable Plot Selection
-Identify plots suitable for sensitive crops (peas, beans) by:
-- Analyzing historical IFT trends
-- Evaluating rotation impacts
-- Assessing risk levels for future years
-### 3. Crop Rotation Optimization
-Optimize rotation sequences through:
-- Impact analysis of different crop sequences
-- Identification of beneficial rotations
-- Risk assessment for specific transitions
-**Best Rotations (Lowest IFT)**:
-1. Peas → Rapeseed: IFT 0.62
-2. Winter Barley → Rapeseed: IFT 0.64
-3. Corn → Spring Barley: IFT 0.69
-### 4. Herbicide Alternative Analysis
-Support reduction strategies through:
-- Product usage pattern analysis
-- Temporal trend identification
-- Alternative strategy development
-## Code Examples
-### Loading the Dataset
-```python
-from datasets import load_dataset
-# Load the dataset
-dataset = load_dataset("HackathonCRA/2024")
-# Convert to pandas for analysis
-import pandas as pd
-df = dataset["train"].to_pandas()
-print(f"Loaded {len(df)} intervention records")
-print(f"Covering {df['year'].nunique()} years")
-```
-### Calculate IFT
-```python
-# Calculate IFT for herbicide applications
-herbicides = df[df['familleprod'].str.contains('Herbicides', na=False)]
-ift_data = herbicides.groupby(['plot_name', 'year', 'crop_type']).agg({
-    'quantitetot': 'sum',
-    'produit': 'count',  # Number of applications
-    'surfparc': 'first'
-}).reset_index()
-ift_data['ift'] = ift_data['produit'] / ift_data['surfparc']
-```
-### Analyze Crop Rotations
-```python
-# Create rotation sequences
-rotations = []
-for plot in df['plot_name'].unique():
-    plot_data = df[df['plot_name'] == plot].sort_values('year')
-    crops = plot_data.groupby('year')['crop_type'].first()
-    for i in range(len(crops)-1):
-        rotation = f"{crops.iloc[i]} → {crops.iloc[i+1]}"
-        rotations.append({
-            'plot': plot,
-            'year_from': crops.index[i],
-            'year_to': crops.index[i+1],
-            'rotation': rotation
-        })
-rotation_df = pd.DataFrame(rotations)
-```
-## Related Datasets
-- **Weather Data**: Consider integrating with Météo-France data for enhanced analysis
-- **Soil Data**: European Soil Database for soil type information
-- **Economic Data**: Agricultural input cost databases
-- **Regulatory Data**: AMM (Marketing Authorization) product databases
-## Citation
-If you use this dataset in your research, please cite:
-```bibtex
-@dataset{hackathon_cra_2024,
-  title={Station Expérimentale de Kerguéhennec Agricultural Interventions Dataset},
-  author={Hackathon CRA Team},
-  year={2024},
-  publisher={Hugging Face},
-  url={https://huggingface.co/datasets/HackathonCRA/2024},
-  note={Agricultural intervention data from Brittany, France (2014-2024)}
-}
-```
-## License
-This dataset is released under CC-BY-4.0 license, allowing for both commercial and research use with proper attribution.
-## Updates and Versioning
-- **Version 1.0**: Initial release with 2014-2024 data
-- **Future versions**: May include additional years or enhanced metadata
-- **Quality improvements**: Ongoing validation and cleaning
-## Contact
-For questions about this dataset, collaboration opportunities, or data corrections, please use the Hugging Face dataset discussion feature or contact the research team through the repository.
----
-**Keywords**: agriculture, herbicides, crop rotation, sustainable farming, France, Brittany, IFT, weed management, precision agriculture, time series, regression, treatment frequency

IMPLEMENTATION_SUMMARY.md DELETED Viewed

@@ -1,202 +0,0 @@
-# 🚜 Agricultural Analysis Tool - Implementation Summary
-## ✅ Successfully Implemented
-### 🎯 Project Objectives - COMPLETED
-- ✅ **Weed pressure prediction** for next 3 years using machine learning
-- ✅ **Plot identification** for sensitive crops (peas, beans)
-- ✅ **IFT analysis** (Treatment Frequency Index) for herbicide usage
-- ✅ **Crop rotation impact** analysis on weed pressure
-- ✅ **Historical data integration** from Station Expérimentale de Kerguéhennec (2014-2024)
-- ✅ **Herbicide alternative analysis** and usage patterns
-### 🏗️ Technical Architecture - COMPLETED
-#### 1. **MCP Server** (`mcp_server.py`)
-- ✅ Model Context Protocol compliant server
-- ✅ 7 tools for data analysis and filtering
-- ✅ 6 resources for data access
-- ✅ JSON-based responses for LLM integration
-- ✅ Error handling and logging
-#### 2. **Data Processing** (`data_loader.py`)
-- ✅ Loads 10+ CSV/Excel files automatically
-- ✅ Handles mixed data formats (CSV + Excel)
-- ✅ Data preprocessing and cleaning
-- ✅ Derived metrics calculation (IFT, crop types, etc.)
-- ✅ Caching for performance
-#### 3. **Analysis Engine** (`analysis_tools.py`)
-- ✅ Statistical analysis of intervention data
-- ✅ Random Forest prediction model for weed pressure
-- ✅ Interactive Plotly visualizations
-- ✅ Crop rotation sequence analysis
-- ✅ Risk level classification (low/medium/high)
-#### 4. **Gradio Interface** (`gradio_app.py`)
-- ✅ 6-tab interactive web interface
-- ✅ Real-time filtering and analysis
-- ✅ Interactive plots and visualizations
-- ✅ Export capabilities
-- ✅ User-friendly French interface
-#### 5. **Hugging Face Integration** (`hf_integration.py`, `app.py`)
-- ✅ HF Spaces deployment configuration
-- ✅ Dataset upload functionality
-- ✅ Environment variable management
-- ✅ Production-ready app entry point
-### 📊 Data Analysis Results
-#### **Dataset Statistics**
-- **Records processed**: 4,663 interventions
-- **Time period**: 2014-2024 (10 years)
-- **Plots analyzed**: 100 unique parcels
-- **Crop types**: 42 different crops
-- **Herbicide applications**: 800+ treatments
-#### **Key Findings**
-- **Average IFT**: 1.93 (moderate weed pressure)
-- **IFT trends**: Decreasing from 2.91 (2014) to 1.74 (2024)
-- **Best rotations**: pois → colza (IFT: 0.62), orge → colza (IFT: 0.64)
-- **Worst rotations**: colza → triticale (IFT: 2.79)
-- **Top herbicides**: BISCOTO, CALLISTO, PRIMUS
-### 🔧 Tools and Features
-#### **MCP Tools Available**
-1. `filter_data` - Filter by years, plots, crops, interventions
-2. `analyze_weed_pressure` - IFT analysis with visualizations
-3. `predict_weed_pressure` - ML predictions for 2025-2027
-4. `identify_suitable_plots` - Find plots for sensitive crops
-5. `analyze_crop_rotation` - Rotation impact analysis
-6. `analyze_herbicide_alternatives` - Product usage patterns
-7. `get_data_statistics` - Comprehensive data summaries
-#### **Gradio Interface Tabs**
-1. **📊 Aperçu** - Data overview and statistics
-2. **🔍 Filtrage** - Interactive data filtering
-3. **🌿 Pression Adventices** - Weed pressure analysis
-4. **🔮 Prédictions** - ML-based predictions
-5. **🔄 Rotations** - Crop rotation analysis
-6. **💊 Herbicides** - Product usage analysis
-### 🚀 Deployment Options
-#### **Local Development**
-```bash
-# Quick start
-python launch.py
-# Individual components
-python gradio_app.py    # Web interface
-python mcp_server.py    # MCP server
-python demo.py          # Demo script
-```
-#### **Hugging Face Spaces**
-```bash
-python app.py  # HF-compatible launcher
-```
-#### **Docker/Cloud**
-- All dependencies in `requirements.txt`
-- Environment variables configured
-- Production-ready settings
-### 📈 Performance Metrics
-#### **Model Performance**
-- **R² Score**: 0.65-0.85 (varies by data split)
-- **Prediction accuracy**: Good for identifying trends
-- **Processing speed**: < 2 seconds for full analysis
-- **Memory usage**: < 500MB for full dataset
-#### **System Performance**
-- **Data loading**: < 5 seconds for all files
-- **Analysis completion**: < 10 seconds
-- **Visualization generation**: < 3 seconds
-- **Web interface response**: < 1 second
-### 🎯 Business Impact
-#### **For Farmers**
-- ✅ **Reduced herbicide usage** through targeted application
-- ✅ **Optimized crop placement** on suitable plots
-- ✅ **Improved rotation planning** based on data insights
-- ✅ **Risk assessment** for sensitive crops
-#### **For Agricultural Advisors**
-- ✅ **Data-driven recommendations** with historical backing
-- ✅ **Visual analysis tools** for client presentations
-- ✅ **Comparative analysis** across plots and years
-- ✅ **Regulatory compliance** tracking (IFT monitoring)
-#### **For Researchers**
-- ✅ **Comprehensive dataset** for further research
-- ✅ **Reproducible analysis** methods
-- ✅ **ML model** for extension to other regions
-- ✅ **Open source tools** for collaboration
-### 🌍 Environmental Benefits
-- **Herbicide reduction**: Targeted application reduces overall usage
-- **Biodiversity protection**: Lower chemical pressure on ecosystems
-- **Soil health**: Optimized rotations improve soil structure
-- **Water quality**: Reduced runoff from excess treatments
-### 📋 Next Steps and Extensions
-#### **Immediate Enhancements**
-1. **Weather data integration** for improved predictions
-2. **Soil type classification** for more precise recommendations
-3. **Economic analysis** (cost vs. benefit of treatments)
-4. **Mobile app development** for field use
-#### **Advanced Features**
-1. **Real-time monitoring** with IoT sensors
-2. **Satellite imagery** integration for precision agriculture
-3. **AI-powered recommendations** using larger language models
-4. **Multi-farm analysis** for regional insights
-#### **Research Opportunities**
-1. **Climate change impact** modeling
-2. **Resistance development** tracking
-3. **Biodiversity indicators** integration
-4. **Carbon footprint** assessment
-## 🏆 Project Success Metrics
-### ✅ All Objectives Met
-- **Functional MCP Server**: ✅ 100% operational
-- **Gradio Interface**: ✅ Fully interactive
-- **Data Analysis**: ✅ Comprehensive insights
-- **Prediction Model**: ✅ Working with good accuracy
-- **HF Compatibility**: ✅ Ready for deployment
-- **Documentation**: ✅ Complete with examples
-### 📊 Technical Achievements
-- **Code Quality**: Clean, modular, well-documented
-- **Performance**: Fast, efficient, scalable
-- **User Experience**: Intuitive, visual, informative
-- **Deployment**: Multiple options, production-ready
-### 🎯 Business Value
-- **Actionable Insights**: Clear recommendations for farmers
-- **Cost Reduction**: Optimized herbicide usage
-- **Risk Mitigation**: Better crop placement decisions
-- **Compliance**: IFT tracking for regulations
----
-## 🚀 Ready for Production
-The Agricultural Analysis Tool is **production-ready** with:
-- ✅ **Stable codebase** with error handling
-- ✅ **Comprehensive testing** via demo script
-- ✅ **Multiple deployment options** (local, cloud, HF)
-- ✅ **Complete documentation** and examples
-- ✅ **Scalable architecture** for future enhancements
-**🎉 Project completed successfully for the CRA Hackathon!**

MODEL_CARD.md DELETED Viewed

@@ -1,296 +0,0 @@
----
-license: cc-by-4.0
-library_name: scikit-learn
-pipeline_tag: tabular-regression
-tags:
-- agriculture
-- herbicides
-- weed-pressure
-- crop-rotation
-- time-series-forecasting
-- sustainability
-- random-forest
-datasets:
-- HackathonCRA/2024
-language:
-- fr
-base_model: null
-model-index:
-- name: Agricultural Weed Pressure Predictor
-  results:
-  - task:
-      type: tabular-regression
-      name: Treatment Frequency Index Prediction
-    dataset:
-      name: Station Expérimentale de Kerguéhennec
-      type: HackathonCRA/2024
-    metrics:
-    - name: R² Score
-      type: r2_score
-      value: 0.75
-    - name: Mean Squared Error
-      type: mean_squared_error
-      value: 0.42
-    - name: Mean Absolute Error
-      type: mean_absolute_error
-      value: 0.51
----
-# 🚜 Agricultural Weed Pressure Predictor
-## Model Description
-This Random Forest regression model predicts the Treatment Frequency Index (IFT) for herbicide applications in agricultural plots, specifically designed to help farmers in Brittany, France optimize their weed management strategies and identify suitable plots for sensitive crops like peas and beans.
-## Model Details
-### Architecture
-- **Model Type**: Random Forest Regressor
-- **Framework**: scikit-learn
-- **Target Variable**: IFT (Treatment Frequency Index) for herbicides
-- **Prediction Horizon**: 1-3 years ahead (2025-2027)
-- **Input Features**: 15+ engineered features
-### Training Details
-- **Training Data**: 10 years of agricultural intervention records (2014-2024)
-- **Source**: Station Expérimentale de Kerguéhennec, Brittany, France
-- **Records**: 4,663 intervention records across 100 plots
-- **Validation**: Temporal split (train on 2014-2022, validate on 2023-2024)
-## Intended Use
-### Primary Use Cases
-1. **🎯 Plot Selection**: Identify plots suitable for sensitive crops (IFT < 1.0)
-2. **📊 Weed Pressure Forecasting**: Predict future herbicide requirements
-3. **🌱 Sustainable Agriculture**: Support herbicide reduction strategies
-4. **🔄 Rotation Planning**: Optimize crop sequences for reduced weed pressure
-### Target Users
-- **Farmers**: Decision support for crop placement and rotation planning
-- **Agricultural Advisors**: Data-driven recommendations for clients
-- **Researchers**: Analysis of farming practice impacts
-- **Policy Makers**: Assessment of sustainable agriculture initiatives
-## Model Performance
-### Evaluation Metrics
-- **R² Score**: 0.75 (explains 75% of variance in IFT)
-- **Mean Squared Error**: 0.42
-- **Mean Absolute Error**: 0.51
-- **RMSE**: 0.65
-### Performance by Risk Category
-| Risk Level | Precision | Recall | F1-Score |
-|------------|-----------|--------|----------|
-| Low (IFT < 1.0) | 0.82 | 0.78 | 0.80 |
-| Medium (1.0-2.0) | 0.71 | 0.74 | 0.72 |
-| High (IFT > 2.0) | 0.69 | 0.67 | 0.68 |
-### Feature Importance
-1. **Previous IFT** (0.35) - Historical weed pressure
-2. **Crop Type** (0.28) - Current crop being grown
-3. **Rotation Sequence** (0.18) - Previous crop type
-4. **Plot Surface** (0.12) - Size of the agricultural plot
-5. **Year Trend** (0.07) - Temporal evolution patterns
-## Features
-### Input Variables
-- **Temporal**: Year, seasonal trends
-- **Spatial**: Plot identifier, surface area
-- **Agronomic**: Current crop, previous crop, rotation type
-- **Historical**: Previous IFT values, treatment trends
-- **Derived**: Rotation sequences, trend indicators
-### Feature Engineering
-```python
-# Example feature creation
-features['prev_ift'] = grouped_data['ift'].shift(1)
-features['crop_rotation'] = prev_crop + ' → ' + current_crop
-features['ift_trend'] = features['ift'].rolling(3).apply(lambda x: np.polyfit(range(3), x, 1)[0])
-```
-## Training Procedure
-### Data Preprocessing
-1. **Temporal Aggregation**: Group interventions by plot-year-crop
-2. **IFT Calculation**: `IFT = applications / plot_surface`
-3. **Feature Engineering**: Create rotation sequences and trends
-4. **Categorical Encoding**: One-hot encoding for crops and plots
-5. **Normalization**: StandardScaler for numerical features
-### Model Training
-```python
-from sklearn.ensemble import RandomForestRegressor
-from sklearn.model_selection import TimeSeriesSplit
-model = RandomForestRegressor(
-    n_estimators=100,
-    max_depth=10,
-    min_samples_split=5,
-    min_samples_leaf=2,
-    random_state=42
-)
-# Temporal cross-validation
-tscv = TimeSeriesSplit(n_splits=5)
-model.fit(X_train, y_train)
-```
-### Hyperparameters
-- **n_estimators**: 100 trees
-- **max_depth**: 10 levels
-- **min_samples_split**: 5 samples
-- **min_samples_leaf**: 2 samples
-- **random_state**: 42 (reproducibility)
-## Evaluation
-### Validation Strategy
-- **Temporal Split**: Train on 2014-2022, test on 2023-2024
-- **Cross-validation**: 5-fold time series cross-validation
-- **Holdout**: 20% of most recent data reserved for final evaluation
-### Performance Analysis
-The model performs best for:
-- ✅ **Stable rotations**: Well-established crop sequences
-- ✅ **Medium-sized plots**: 1-5 hectare plots
-- ✅ **Common crops**: Wheat, corn, rapeseed
-Challenges with:
-- ⚠️ **New crop varieties**: Limited training examples
-- ⚠️ **Extreme weather years**: Unusual climatic conditions
-- ⚠️ **Very small/large plots**: Edge cases in plot sizes
-## Limitations and Biases
-### Geographic Limitations
-- **Single Location**: Trained only on Brittany data
-- **Climate Specificity**: Oceanic climate conditions
-- **Soil Types**: Limited soil variety representation
-### Temporal Limitations
-- **Recent Data Bias**: Model may not capture long-term cycles
-- **Technology Evolution**: Changing agricultural practices over time
-- **Climate Change**: Shifting baseline conditions
-### Agricultural Limitations
-- **Experimental Station**: May not represent typical farms
-- **Crop Varieties**: Limited to varieties grown at the station
-- **Management Practices**: Research station vs. commercial practices
-### Algorithmic Biases
-- **Historical Bias**: Perpetuates past treatment patterns
-- **Sampling Bias**: Overrepresentation of certain crops/rotations
-- **Measurement Bias**: IFT calculation methodology assumptions
-## Ethical Considerations
-### Environmental Impact
-- **Positive**: Supports herbicide reduction strategies
-- **Risk**: Over-reliance on predictions might ignore local conditions
-- **Mitigation**: Always combine with expert agronomic advice
-### Economic Implications
-- **Farmers**: Could affect income through crop choice recommendations
-- **Industry**: May influence herbicide market demand
-- **Policy**: Could inform agricultural subsidy decisions
-### Responsible Use
-- **Expert Validation**: Predictions should be validated by agronomists
-- **Local Adaptation**: Model outputs need local context consideration
-- **Continuous Monitoring**: Regular model performance assessment
-## How to Use
-### Installation
-```bash
-pip install scikit-learn pandas numpy
-```
-### Basic Usage
-```python
-from analysis_tools import AgriculturalAnalyzer
-from data_loader import AgriculturalDataLoader
-# Initialize components
-data_loader = AgriculturalDataLoader()
-analyzer = AgriculturalAnalyzer(data_loader)
-# Make predictions
-predictions = analyzer.predict_weed_pressure(
-    target_years=[2025, 2026, 2027]
-)
-# Identify suitable plots
-suitable_plots = analyzer.identify_suitable_plots_for_sensitive_crops(
-    target_years=[2025, 2026, 2027],
-    max_ift_threshold=1.0
-)
-```
-### API Integration
-The model is available through the MCP (Model Context Protocol) server:
-```python
-# Via MCP server
-tool_result = await mcp_client.call_tool(
-    "predict_weed_pressure",
-    {"target_years": [2025, 2026, 2027]}
-)
-```
-## Model Updates
-### Version History
-- **v1.0**: Initial release with 2014-2024 data
-- **Future**: Regular updates with new seasonal data
-### Retraining Schedule
-- **Annual**: Incorporate new year's intervention data
-- **Seasonal**: Adjust for significant practice changes
-- **Performance-based**: Retrain when accuracy drops below threshold
-## Validation in Production
-### Monitoring Metrics
-- **Prediction Accuracy**: Compare with actual IFT values
-- **User Feedback**: Farmer success with recommendations
-- **Agronomic Validation**: Expert review of predictions
-### Performance Thresholds
-- **R² Score**: Maintain > 0.70
-- **MAE**: Keep < 0.60
-- **False Positive Rate**: < 15% for low-risk classifications
-## Carbon Footprint
-### Training Emissions
-- **Computing**: Minimal due to small dataset size (~1kg CO2)
-- **Data Storage**: Negligible impact
-- **Total Estimated**: < 2kg CO2 equivalent
-### Positive Environmental Impact
-- **Herbicide Reduction**: Potential 10-20% reduction in applications
-- **Optimized Farming**: More efficient resource use
-- **Sustainable Practices**: Support for ecological agriculture
-## Citation
-```bibtex
-@model{agricultural_weed_predictor_2024,
-  title={Agricultural Weed Pressure Predictor for Brittany Region},
-  author={Hackathon CRA Team},
-  year={2024},
-  publisher={Hugging Face},
-  url={https://huggingface.co/spaces/USERNAME/agricultural-analysis},
-  note={Random Forest model for predicting herbicide Treatment Frequency Index}
-}
-```
-## Contact
-For questions about the model, improvements, or collaboration opportunities, please use the Hugging Face Space discussions or contact the development team.
----
-**Developed for sustainable agriculture in Brittany, France** 🌱

GOAL.md → PROMPT.md RENAMED Viewed

@@ -59,4 +59,39 @@ Concevoir et implémenter un serveur MCP conforme aux objectifs ci-dessus.
 Exposer ce serveur via une interface Gradio, compatible avec Hugging Face.
-Fournir des tools et resources exploitables par un LLM, permettant d’effectuer des analyses fiables, visuelles et interactives.

 Exposer ce serveur via une interface Gradio, compatible avec Hugging Face.
+Fournir des tools et resources exploitables par un LLM, permettant d’effectuer des analyses fiables, visuelles et interactives.
+Voici de la documentation pour faire des mcp avec gradio :
+- https://www.gradio.app/guides/building-mcp-server-with-gradio
+- https://huggingface.co/blog/gradio-mcp
+Voici un exemple de MCP qui fonctionne actuellement :
+import gradio as gr
+```
+def letter_counter(word, letter):
+    """Count the occurrences of a specific letter in a word.
+    Args:
+        word: The word or phrase to analyze
+        letter: The letter to count occurrences of
+    Returns:
+        The number of times the letter appears in the word
+    """
+    return word.lower().count(letter.lower())
+demo = gr.Interface(
+    fn=letter_counter,
+    inputs=["text", "text"],
+    outputs="number",
+    title="Letter Counter",
+    description="Count how many times a letter appears in a word"
+)
+demo.launch(mcp_server=True)
+```
+Appuies toi sur cette documentation pour produire ce MCP, au plus simple et efficace pour avoir un produit fonctionnel.

README.md CHANGED Viewed

@@ -1,180 +1,109 @@
----
-title: Agricultural Analysis - Kerguéhennec
-emoji: 🚜
-colorFrom: green
-colorTo: blue
-sdk: gradio
-sdk_version: 4.25.0
-app_file: app.py
-pinned: false
-license: cc-by-4.0
-language:
-- fr
-tags:
-- agriculture
-- herbicides
-- weed-pressure
-- crop-rotation
-- france
-- bretagne
-- sustainability
-- precision-agriculture
-- machine-learning
-- time-series
-datasets:
-- HackathonCRA/2024
-library_name: gradio
-pipeline_tag: tabular-regression
----
-# 🚜 Analyse Agricole - Station de Kerguéhennec
-## Vue d'ensemble
-Outil d'analyse des données agricoles développé pour le hackathon CRA, permettant d'anticiper et réduire la pression des adventices dans les parcelles agricoles bretonnes. L'outil s'appuie sur l'analyse des données historiques d'interventions pour identifier les parcelles les plus adaptées aux cultures sensibles (pois, haricot).
-## 🎯 Objectifs
-- **Prédire la pression adventices** sur chaque parcelle pour les 3 prochaines campagnes
-- **Identifier les parcelles à faible risque** adaptées aux cultures sensibles
-- **Analyser l'impact des rotations** culturales sur la pression adventices
-- **Proposer des alternatives** en cas de retrait de certaines molécules herbicides
-## 📊 Données
-### Source des données
-- **Station Expérimentale de Kerguéhennec** (Bretagne, France)
-- **Période**: 2014-2024 (10 années)
-- **Volume**: 4,663 enregistrements d'interventions
-- **Couverture**: 100 parcelles, 42 types de cultures
-### Métriques clés
-- **IFT moyen**: 1.93 (pression modérée)
-- **Applications herbicides**: 800+ traitements analysés
-- **Évolution**: Diminution de l'IFT de 2.91 (2014) à 1.74 (2024)
-## 🔧 Fonctionnalités
-### 1. Analyse de la Pression Adventices
-- Calcul de l'IFT (Indice de Fréquence de Traitement)
-- Visualisations interactives des tendances
-- Classification des risques (faible/moyen/élevé)
-### 2. Prédictions Machine Learning
-- Modèle Random Forest pour prédire l'IFT 2025-2027
-- R² Score: 0.65-0.85
-- Identification automatique des parcelles adaptées
-### 3. Analyse des Rotations
-- Impact des séquences culturales sur la pression adventices
-- Identification des meilleures rotations
-- Recommandations d'optimisation
-### 4. Interface Interactive
-- 6 onglets d'analyse spécialisés
-- Filtrage en temps réel
-- Visualisations Plotly interactives
-- Export des résultats
-## 🚀 Utilisation
-### Interface Web
-1. Sélectionnez l'onglet correspondant à votre analyse
-2. Configurez les filtres (années, parcelles, cultures)
-3. Lancez l'analyse pour obtenir les résultats
-4. Explorez les visualisations interactives
-### Onglets disponibles
-- **📊 Aperçu**: Vue d'ensemble des données
-- **🔍 Filtrage**: Exploration interactive
-- **🌿 Pression Adventices**: Analyse IFT
-- **🔮 Prédictions**: Modèle prédictif ML
-- **🔄 Rotations**: Impact des rotations
-- **💊 Herbicides**: Analyse des produits
-## 🧮 Méthodologie
-### Calcul de l'IFT
-```
-IFT = Nombre d'applications / Surface de la parcelle
-```
-### Seuils d'interprétation
-- **IFT < 1.0**: Pression faible (adapté cultures sensibles)
-- **IFT 1.0-2.0**: Pression modérée (surveillance nécessaire)
-- **IFT > 2.0**: Pression élevée (intervention requise)
-### Modèle Prédictif
-- **Algorithme**: Random Forest Regressor
-- **Variables**: Année, surface, IFT historique, culture, rotation
-- **Validation**: Division temporelle des données
-## 📈 Résultats Clés
-### Rotations Optimales
-1. **Pois → Colza**: IFT 0.62 (excellent)
-2. **Orge → Colza**: IFT 0.64 (très bon)
-3. **Maïs → Orge**: IFT 0.69 (bon)
-### Herbicides Principaux
-1. **BISCOTO** (blé): 21 applications
-2. **CALLISTO** (maïs): 20 applications
-3. **PRIMUS** (blé): 20 applications
-### Parcelles Recommandées (IFT < 1.0)
-Identification automatique des parcelles les plus adaptées aux cultures sensibles pour les années 2025-2027.
-## 🌍 Impact Environnemental
-- **Réduction herbicides**: Application ciblée basée sur les données
-- **Protection biodiversité**: Diminution de la pression chimique
-- **Santé des sols**: Rotations optimisées
-- **Qualité de l'eau**: Réduction du ruissellement
-## 🏆 Architecture Technique
-### Composants
-- **Serveur MCP**: Protocol Model Context pour intégration LLM
-- **Interface Gradio**: Application web interactive
-- **Moteur d'analyse**: Machine Learning et statistiques
-- **Intégration HF**: Déploiement et partage de données
-### Performance
-- **Chargement données**: < 5 secondes
-- **Analyse complète**: < 10 secondes
-- **Génération graphiques**: < 3 secondes
-- **Réponse interface**: < 1 seconde
-## 📚 Documentation
-### Guide d'utilisation
-Chaque onglet contient des instructions intégrées et des exemples d'utilisation.
-### API et outils
-- 7 outils d'analyse via serveur MCP
-- 6 ressources de données structurées
-- Format JSON pour intégration
-## 🤝 Contribution
-Développé pour le hackathon CRA dans le but d'aider les agriculteurs bretons à optimiser leurs pratiques phytosanitaires.
-### Équipe
-- Analyse des données agricoles
-- Développement d'outils d'aide à la décision
-- Interface utilisateur intuitive
-## 📞 Support
-Pour questions techniques ou suggestions d'amélioration, utilisez les fonctionnalités de discussion de l'espace Hugging Face.
----
-**Développé avec ❤️ pour l'agriculture bretonne et la réduction des pesticides**
-## 🔗 Liens Utiles
-- [Documentation complète](README.md)
-- [Code source](https://huggingface.co/spaces/USERNAME/agricultural-analysis/tree/main)
-- [Dataset utilisé](https://huggingface.co/datasets/HackathonCRA/2024)
-- [Guide méthodologique](IMPLEMENTATION_SUMMARY.md)

+# 🚜 Hackathon CRA - Analyse Pression Adventices
+## 🎯 Objectif
+Serveur MCP (Model Context Protocol) pour anticiper et réduire la pression des adventices dans les parcelles agricoles bretonnes, en s'appuyant sur l'analyse des données historiques de la Station Expérimentale de Kerguéhennec (2014-2024).
+## 🔍 Fonctionnalités
+### 📈 Analyse des Tendances IFT
+- Calcul de l'Indice de Fréquence de Traitement (IFT) herbicides
+- Évolution temporelle par parcelle et par culture
+- Filtrage par période et parcelle
+### 🔮 Prédictions 2025-2027
+- Modèle prédictif basé sur les tendances historiques
+- Classification des risques (Faible/Modéré/Élevé)
+- Visualisations interactives
+### 🌱 Recommandations Cultures Sensibles
+- Identification des parcelles adaptées aux pois et haricot
+- Score de recommandation basé sur l'IFT prédit
+- Critères de sélection optimisés
+### 🔄 Alternatives Techniques
+- Propositions d'alternatives mécaniques, culturales et biologiques
+- Plans d'action pour réduction des herbicides
+- Documentation des meilleures pratiques
+## ⚙️ Installation
+```bash
+# Cloner le projet
+git clone <repo-url>
+cd mcp
+# Installer les dépendances
+pip install -r requirements.txt
+# Configuration Hugging Face (optionnel)
+export HF_TOKEN="your_hf_token"
+export DATASET_ID="HackathonCRA/2024"
+```
+## 🚀 Lancement
+### Local
+```bash
+python mcp_server.py
+```
+### Hugging Face Spaces
+```bash
+python app.py
+```
+Le serveur MCP sera accessible sur `http://localhost:7860`
+## 📊 Structure des Données
+Les données proviennent de la Station Expérimentale de Kerguéhennec et incluent :
+- **Variables temporelles** : millésime, dates d'intervention
+- **Variables spatiales** : parcelles, surfaces
+- **Variables culturales** : types de cultures, rotations
+- **Variables techniques** : produits utilisés, quantités, IFT
+## 🤖 Architecture MCP
+Le serveur expose des outils d'analyse via le protocole MCP :
+1. **analyze_herbicide_trends** : Analyse des tendances IFT
+2. **predict_future_weed_pressure** : Prédictions 2025-2027
+3. **recommend_sensitive_crop_plots** : Recommandations parcelles
+4. **generate_technical_alternatives** : Alternatives techniques
+## 📈 Méthodes d'Analyse
+### Calcul IFT Herbicides
+```
+IFT = Nombre d'applications / Surface parcelle
+```
+### Prédiction Pression Adventices
+- Régression linéaire sur données historiques
+- Classification en niveaux de risque
+- Extrapolation 2025-2027
+### Score de Recommandation
+```
+Score = 100 - (IFT_prédit × 30)
+```
+## 🛠️ Technologies
+- **Gradio** : Interface utilisateur et serveur MCP
+- **Pandas/Numpy** : Traitement des données
+- **Plotly** : Visualisations interactives
+- **Hugging Face** : Hébergement et datasets
+- **Python 3.8+** : Langage principal
+## 📝 Licence
+Projet développé dans le cadre du Hackathon CRA Bretagne 2024.
+## 🤝 Contact
+- **Équipe** : Hackathon CRA Bretagne
+- **Données** : Station Expérimentale de Kerguéhennec
+- **Support** : GitHub Issues

analysis_tools.py DELETED Viewed

@@ -1,368 +0,0 @@
-"""
-Analysis tools for agricultural data.
-Provides statistical analysis and visualization capabilities.
-"""
-import pandas as pd
-import numpy as np
-import matplotlib.pyplot as plt
-import seaborn as sns
-import plotly.express as px
-import plotly.graph_objects as go
-from plotly.subplots import make_subplots
-from sklearn.ensemble import RandomForestRegressor
-from sklearn.model_selection import train_test_split
-from sklearn.metrics import mean_squared_error, r2_score
-from typing import List, Dict, Optional, Tuple, Any
-import warnings
-warnings.filterwarnings('ignore')
-class AgriculturalAnalyzer:
-    """Provides analysis tools for agricultural intervention data."""
-    def __init__(self, data_loader):
-        self.data_loader = data_loader
-        self.prediction_models = {}
-    def analyze_weed_pressure_trends(self,
-                                   years: Optional[List[int]] = None,
-                                   plots: Optional[List[str]] = None) -> Dict[str, Any]:
-        """Analyze weed pressure trends based on herbicide usage."""
-        herbicide_data = self.data_loader.get_herbicide_usage(years=years)
-        if plots:
-            herbicide_data = herbicide_data[herbicide_data['plot_name'].isin(plots)]
-        # Calculate trends
-        trends = {}
-        # Overall IFT trend by year
-        yearly_ift = herbicide_data.groupby('year')['ift_herbicide'].mean().reset_index()
-        trends['yearly_ift'] = yearly_ift
-        # IFT trend by plot
-        plot_ift = herbicide_data.groupby(['plot_name', 'year'])['ift_herbicide'].mean().reset_index()
-        trends['plot_ift'] = plot_ift
-        # IFT trend by crop type
-        crop_ift = herbicide_data.groupby(['crop_type', 'year'])['ift_herbicide'].mean().reset_index()
-        trends['crop_ift'] = crop_ift
-        # Statistical summary
-        summary_stats = {
-            'mean_ift': herbicide_data['ift_herbicide'].mean(),
-            'std_ift': herbicide_data['ift_herbicide'].std(),
-            'min_ift': herbicide_data['ift_herbicide'].min(),
-            'max_ift': herbicide_data['ift_herbicide'].max(),
-            'total_applications': herbicide_data['num_applications'].sum(),
-            'unique_plots': herbicide_data['plot_name'].nunique(),
-            'unique_crops': herbicide_data['crop_type'].nunique()
-        }
-        trends['summary'] = summary_stats
-        return trends
-    def create_weed_pressure_visualization(self,
-                                         years: Optional[List[int]] = None,
-                                         plots: Optional[List[str]] = None) -> go.Figure:
-        """Create interactive visualization of weed pressure trends."""
-        trends = self.analyze_weed_pressure_trends(years=years, plots=plots)
-        # Create subplots
-        fig = make_subplots(
-            rows=2, cols=2,
-            subplot_titles=('IFT Evolution par Année', 'IFT par Parcelle',
-                          'IFT par Type de Culture', 'Distribution IFT'),
-            specs=[[{"secondary_y": False}, {"secondary_y": False}],
-                   [{"secondary_y": False}, {"secondary_y": False}]]
-        )
-        # Plot 1: Yearly IFT trend
-        yearly_data = trends['yearly_ift']
-        fig.add_trace(
-            go.Scatter(x=yearly_data['year'], y=yearly_data['ift_herbicide'],
-                      mode='lines+markers', name='IFT Moyen',
-                      line=dict(color='blue')),
-            row=1, col=1
-        )
-        # Plot 2: IFT by plot
-        plot_data = trends['plot_ift']
-        for plot in plot_data['plot_name'].unique():
-            plot_subset = plot_data[plot_data['plot_name'] == plot]
-            fig.add_trace(
-                go.Scatter(x=plot_subset['year'], y=plot_subset['ift_herbicide'],
-                          mode='lines+markers', name=f'Parcelle {plot}',
-                          showlegend=False),
-                row=1, col=2
-            )
-        # Plot 3: IFT by crop
-        crop_data = trends['crop_ift']
-        for crop in crop_data['crop_type'].unique()[:5]:  # Limit to top 5 crops
-            crop_subset = crop_data[crop_data['crop_type'] == crop]
-            fig.add_trace(
-                go.Scatter(x=crop_subset['year'], y=crop_subset['ift_herbicide'],
-                          mode='lines+markers', name=crop,
-                          showlegend=False),
-                row=2, col=1
-            )
-        # Plot 4: IFT distribution
-        herbicide_data = self.data_loader.get_herbicide_usage(years=years)
-        if plots:
-            herbicide_data = herbicide_data[herbicide_data['plot_name'].isin(plots)]
-        fig.add_trace(
-            go.Histogram(x=herbicide_data['ift_herbicide'],
-                        name='Distribution IFT',
-                        showlegend=False),
-            row=2, col=2
-        )
-        # Update layout
-        fig.update_layout(
-            title_text="Analyse de la Pression Adventices (IFT Herbicides)",
-            height=800,
-            showlegend=True
-        )
-        # Update axes labels
-        fig.update_xaxes(title_text="Année", row=1, col=1)
-        fig.update_yaxes(title_text="IFT Herbicide", row=1, col=1)
-        fig.update_xaxes(title_text="Année", row=1, col=2)
-        fig.update_yaxes(title_text="IFT Herbicide", row=1, col=2)
-        fig.update_xaxes(title_text="Année", row=2, col=1)
-        fig.update_yaxes(title_text="IFT Herbicide", row=2, col=1)
-        fig.update_xaxes(title_text="IFT Herbicide", row=2, col=2)
-        fig.update_yaxes(title_text="Fréquence", row=2, col=2)
-        return fig
-    def analyze_crop_rotation_impact(self) -> pd.DataFrame:
-        """Analyze the impact of crop rotation on weed pressure."""
-        df = self.data_loader.load_all_files()
-        # Group by plot and year to get crop sequences
-        plot_years = df.groupby(['plot_name', 'year'])['crop_type'].first().reset_index()
-        plot_years = plot_years.sort_values(['plot_name', 'year'])
-        # Create rotation sequences
-        rotations = []
-        for plot in plot_years['plot_name'].unique():
-            plot_data = plot_years[plot_years['plot_name'] == plot].sort_values('year')
-            crops = plot_data['crop_type'].tolist()
-            years = plot_data['year'].tolist()
-            for i in range(len(crops)-1):
-                rotations.append({
-                    'plot_name': plot,
-                    'year_from': years[i],
-                    'year_to': years[i+1],
-                    'crop_from': crops[i],
-                    'crop_to': crops[i+1],
-                    'rotation_type': f"{crops[i]} → {crops[i+1]}"
-                })
-        rotation_df = pd.DataFrame(rotations)
-        # Get herbicide usage for each rotation
-        herbicide_data = self.data_loader.get_herbicide_usage()
-        # Merge with rotation data
-        rotation_analysis = rotation_df.merge(
-            herbicide_data[['plot_name', 'year', 'ift_herbicide']],
-            left_on=['plot_name', 'year_to'],
-            right_on=['plot_name', 'year'],
-            how='left'
-        )
-        # Analyze rotation impact
-        rotation_impact = rotation_analysis.groupby('rotation_type').agg({
-            'ift_herbicide': ['mean', 'std', 'count']
-        }).round(3)
-        rotation_impact.columns = ['mean_ift', 'std_ift', 'count']
-        rotation_impact = rotation_impact.reset_index()
-        rotation_impact = rotation_impact[rotation_impact['count'] >= 2]  # At least 2 observations
-        rotation_impact = rotation_impact.sort_values('mean_ift')
-        return rotation_impact
-    def predict_weed_pressure(self,
-                            target_years: List[int] = [2025, 2026, 2027],
-                            plots: Optional[List[str]] = None) -> Dict[str, Any]:
-        """Predict weed pressure for the next 3 years."""
-        # Prepare training data
-        df = self.data_loader.load_all_files()
-        herbicide_data = self.data_loader.get_herbicide_usage()
-        # Create features for prediction
-        features_df = []
-        for plot in herbicide_data['plot_name'].unique():
-            if plots and plot not in plots:
-                continue
-            plot_data = herbicide_data[herbicide_data['plot_name'] == plot].sort_values('year')
-            for i in range(len(plot_data)):
-                row = plot_data.iloc[i].copy()
-                # Add historical features
-                if i > 0:
-                    row['prev_ift'] = plot_data.iloc[i-1]['ift_herbicide']
-                    row['prev_crop'] = plot_data.iloc[i-1]['crop_type']
-                else:
-                    row['prev_ift'] = 0
-                    row['prev_crop'] = 'unknown'
-                # Add trend features
-                if i >= 2:
-                    recent_years = plot_data.iloc[i-2:i+1]
-                    row['ift_trend'] = np.polyfit(range(3), recent_years['ift_herbicide'], 1)[0]
-                else:
-                    row['ift_trend'] = 0
-                features_df.append(row)
-        features_df = pd.DataFrame(features_df)
-        # Prepare features for ML model
-        # Encode categorical variables
-        crop_dummies = pd.get_dummies(features_df['crop_type'], prefix='crop')
-        prev_crop_dummies = pd.get_dummies(features_df['prev_crop'], prefix='prev_crop')
-        plot_dummies = pd.get_dummies(features_df['plot_name'], prefix='plot')
-        X = pd.concat([
-            features_df[['year', 'plot_surface', 'prev_ift', 'ift_trend']],
-            crop_dummies,
-            prev_crop_dummies,
-            plot_dummies
-        ], axis=1)
-        y = features_df['ift_herbicide']
-        # Remove rows with missing values
-        mask = ~(X.isnull().any(axis=1) | y.isnull())
-        X = X[mask]
-        y = y[mask]
-        # Train model
-        X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
-        model = RandomForestRegressor(n_estimators=100, random_state=42)
-        model.fit(X_train, y_train)
-        # Evaluate model
-        y_pred = model.predict(X_test)
-        mse = mean_squared_error(y_test, y_pred)
-        r2 = r2_score(y_test, y_pred)
-        # Make predictions for target years
-        predictions = {}
-        for year in target_years:
-            year_predictions = []
-            # Get last known data for each plot
-            plot_columns = [col for col in X.columns if col.startswith('plot_')]
-            unique_plots = [col.replace('plot_', '') for col in plot_columns]
-            for plot in unique_plots:
-                if plots and plot not in plots:
-                    continue
-                # Find last known data for this plot
-                plot_mask = features_df['plot_name'] == plot
-                if not plot_mask.any():
-                    continue
-                last_data = features_df[plot_mask].iloc[-1]
-                # Create prediction features
-                pred_row = pd.Series(index=X.columns, dtype=float)
-                pred_row['year'] = year
-                pred_row['plot_surface'] = last_data['plot_surface']
-                pred_row['prev_ift'] = last_data['ift_herbicide']
-                pred_row['ift_trend'] = last_data.get('ift_trend', 0)
-                # Set plot dummy
-                plot_col = f'plot_{plot}'
-                if plot_col in pred_row.index:
-                    pred_row[plot_col] = 1
-                # Assume same crop as last year for now
-                crop_col = f'crop_{last_data["crop_type"]}'
-                if crop_col in pred_row.index:
-                    pred_row[crop_col] = 1
-                prev_crop_col = f'prev_crop_{last_data["crop_type"]}'
-                if prev_crop_col in pred_row.index:
-                    pred_row[prev_crop_col] = 1
-                # Fill missing values with 0
-                pred_row = pred_row.fillna(0)
-                # Make prediction
-                pred_ift = model.predict([pred_row])[0]
-                year_predictions.append({
-                    'plot_name': plot,
-                    'year': year,
-                    'predicted_ift': pred_ift,
-                    'risk_level': 'low' if pred_ift < 1.0 else 'medium' if pred_ift < 2.0 else 'high'
-                })
-            predictions[year] = pd.DataFrame(year_predictions)
-        # Feature importance
-        feature_importance = pd.DataFrame({
-            'feature': X.columns,
-            'importance': model.feature_importances_
-        }).sort_values('importance', ascending=False)
-        return {
-            'predictions': predictions,
-            'model_performance': {'mse': mse, 'r2': r2},
-            'feature_importance': feature_importance
-        }
-    def identify_suitable_plots_for_sensitive_crops(self,
-                                                  target_years: List[int] = [2025, 2026, 2027],
-                                                  max_ift_threshold: float = 1.0) -> Dict[str, List[str]]:
-        """Identify plots suitable for sensitive crops (peas, beans) based on low weed pressure."""
-        predictions = self.predict_weed_pressure(target_years=target_years)
-        suitable_plots = {}
-        for year in target_years:
-            if year not in predictions['predictions']:
-                continue
-            year_data = predictions['predictions'][year]
-            suitable = year_data[year_data['predicted_ift'] <= max_ift_threshold]
-            suitable_plots[year] = suitable['plot_name'].tolist()
-        return suitable_plots
-    def analyze_herbicide_alternatives(self) -> pd.DataFrame:
-        """Analyze herbicide usage patterns and suggest alternatives."""
-        df = self.data_loader.load_all_files()
-        herbicides = df[df['is_herbicide'] == True]
-        # Analyze herbicide usage by product
-        herbicide_usage = herbicides.groupby(['produit', 'crop_type']).agg({
-            'quantitetot': ['sum', 'mean', 'count'],
-            'codeamm': 'first'
-        }).round(3)
-        herbicide_usage.columns = ['total_quantity', 'avg_quantity', 'applications', 'amm_code']
-        herbicide_usage = herbicide_usage.reset_index()
-        herbicide_usage = herbicide_usage.sort_values('applications', ascending=False)
-        # Identify most used herbicides
-        top_herbicides = herbicide_usage.head(20)
-        return top_herbicides

app.py CHANGED Viewed

@@ -1,230 +1,19 @@
-import os
-import gradio as gr
-# Import your existing Gradio app and analysis tools
-from gradio_app import create_gradio_app
-from data_loader import AgriculturalDataLoader
-from analysis_tools import AgriculturalAnalyzer
-# --------- Config ---------
-PORT = int(os.environ.get("PORT", 7860))
-# Initialize agricultural components
-data_loader = AgriculturalDataLoader()
-analyzer = AgriculturalAnalyzer(data_loader)
-# --------- Fonctions MCP pour outils agricoles ---------
-@gr.mcp.tool()
-def analyze_weed_pressure(years: str = "", plots: str = "") -> str:
-    """Analyze weed pressure trends using IFT herbicide data from Kerguéhennec experimental station.
-    Args:
-        years: Comma-separated list of years to analyze (e.g., "2020,2021,2022"). Leave empty for all years.
-        plots: Comma-separated list of plot names to analyze (e.g., "P1,P2,P3"). Leave empty for all plots.
-    Returns:
-        Detailed analysis of weed pressure with IFT statistics and interpretation.
-    """
-    try:
-        # Parse parameters
-        year_list = [int(y.strip()) for y in years.split(",")] if years.strip() else None
-        plot_list = [p.strip() for p in plots.split(",")] if plots.strip() else None
-        trends = analyzer.analyze_weed_pressure_trends(years=year_list, plots=plot_list)
-        summary_stats = trends['summary']
-        result = f"""🌿 ANALYSE DE LA PRESSION ADVENTICES (IFT Herbicides)
-📊 Statistiques pour les années {years or 'toutes'} et parcelles {plots or 'toutes'}:
-• IFT moyen: {summary_stats['mean_ift']:.2f}
-• Écart-type: {summary_stats['std_ift']:.2f}
-• IFT minimum: {summary_stats['min_ift']:.2f}
-• IFT maximum: {summary_stats['max_ift']:.2f}
-• Total applications: {summary_stats['total_applications']}
-• Parcelles analysées: {summary_stats['unique_plots']}
-• Cultures analysées: {summary_stats['unique_crops']}
-💡 Interprétation:
-• IFT < 1.0: Pression faible (adapté aux cultures sensibles)
-• IFT 1.0-2.0: Pression modérée
-• IFT > 2.0: Pression élevée"""
-        return result
-    except Exception as e:
-        return f"❌ Erreur lors de l'analyse: {str(e)}"
-@gr.mcp.tool()
-def predict_future_pressure(target_years: str = "2025,2026,2027", max_ift: float = 1.0) -> str:
-    """Predict future weed pressure and identify suitable plots for sensitive crops.
-    Args:
-        target_years: Comma-separated list of years to predict (e.g., "2025,2026,2027")
-        max_ift: Maximum IFT threshold for sensitive crops (default: 1.0)
-    Returns:
-        Predictions for each year with suitable plots for sensitive crops.
-    """
-    try:
-        year_list = [int(y.strip()) for y in target_years.split(",")]
-        predictions = analyzer.predict_weed_pressure(target_years=year_list)
-        model_perf = predictions['model_performance']
-        result = f"""🔮 PRÉDICTION DE LA PRESSION ADVENTICES
-🤖 Performance du modèle:
-• R² Score: {model_perf['r2']:.3f}
-• Erreur quadratique moyenne: {model_perf['mse']:.3f}
-📈 Prédictions par année:
 """
-        for year in year_list:
-            if year in predictions['predictions']:
-                year_pred = predictions['predictions'][year]
-                result += f"\n📅 {year}:\n"
-                for _, row in year_pred.iterrows():
-                    result += f"• {row['plot_name']}: IFT {row['predicted_ift']:.2f} (Risque: {row['risk_level']})\n"
-        suitable_plots = analyzer.identify_suitable_plots_for_sensitive_crops(
-            target_years=year_list, max_ift_threshold=max_ift
-        )
-        result += f"\n🌱 Parcelles adaptées aux cultures sensibles (IFT < {max_ift}):\n"
-        for year, plots in suitable_plots.items():
-            if plots:
-                result += f"• {year}: {', '.join(plots)}\n"
-            else:
-                result += f"• {year}: Aucune parcelle adaptée\n"
-        return result
-    except Exception as e:
-        return f"❌ Erreur lors de la prédiction: {str(e)}"
-@gr.mcp.tool()
-def analyze_crop_rotation() -> str:
-    """Analyze the impact of crop rotations on weed pressure at Kerguéhennec station.
-    Returns:
-        Analysis of the best crop rotations with lowest average IFT herbicide usage.
-    """
-    try:
-        rotation_impact = analyzer.analyze_crop_rotation_impact()
-        if rotation_impact.empty:
-            return "📊 Pas assez de données pour analyser les rotations"
-        result = "🔄 IMPACT DES ROTATIONS CULTURALES\n\n🏆 Meilleures rotations (IFT moyen le plus bas):\n\n"
-        best_rotations = rotation_impact.head(10)
-        for i, (_, row) in enumerate(best_rotations.iterrows(), 1):
-            result += f"{i}. **{row['rotation_type']}**\n"
-            result += f"   • IFT moyen: {row['mean_ift']:.2f}\n"
-            result += f"   • Écart-type: {row['std_ift']:.2f}\n"
-            result += f"   • Observations: {row['count']}\n\n"
-        result += "💡 Les rotations avec les IFT les plus bas sont généralement plus durables."
-        return result
-    except Exception as e:
-        return f"❌ Erreur lors de l'analyse des rotations: {str(e)}"
-@gr.mcp.tool()
-def get_dataset_summary() -> str:
-    """Get a comprehensive summary of the agricultural dataset from Kerguéhennec experimental station.
-    Returns:
-        Complete summary with statistics, top crops, top plots and data coverage.
-    """
-    try:
-        df = data_loader.load_all_files()
-        if df.empty:
-            return "❌ Aucune donnée disponible"
-        summary = f"""📊 RÉSUMÉ DU DATASET AGRICOLE - STATION DE KERGUÉHENNEC
-📈 Statistiques générales:
-• Total d'enregistrements: {len(df):,}
-• Parcelles uniques: {df['plot_name'].nunique()}
-• Types de cultures: {df['crop_type'].nunique()}
-• Années couvertes: {', '.join(map(str, sorted(df['year'].unique())))}
-• Applications herbicides: {len(df[df['is_herbicide'] == True]):,}
-🌱 Top 5 des cultures:
-{df['crop_type'].value_counts().head(5).to_string()}
-📍 Top 5 des parcelles:
-{df['plot_name'].value_counts().head(5).to_string()}
-🏢 Source: Station Expérimentale de Kerguéhennec"""
-        return summary
-    except Exception as e:
-        return f"❌ Erreur lors du chargement des données: {str(e)}"
-@gr.mcp.resource("agricultural://dataset/summary")
-def dataset_resource() -> str:
-    """Agricultural dataset summary resource for Kerguéhennec experimental station."""
-    return get_dataset_summary()
-@gr.mcp.prompt()
-def agricultural_analysis_prompt(analysis_type: str = "general", focus: str = "sustainability") -> str:
-    """Generate analysis prompts for agricultural data interpretation.
-    Args:
-        analysis_type: Type of analysis (general, weed_pressure, rotation, prediction)
-        focus: Focus area (sustainability, productivity, reduction)
-    Returns:
-        Customized prompt for agricultural analysis.
-    """
-    prompts = {
-        "general": "Analyze the agricultural data to provide insights on farming practices and sustainability",
-        "weed_pressure": "Focus on weed pressure analysis and herbicide usage patterns",
-        "rotation": "Examine crop rotation strategies and their impact on weed management",
-        "prediction": "Predict future agricultural trends and provide recommendations"
-    }
-    focus_additions = {
-        "sustainability": "with emphasis on sustainable and eco-friendly practices",
-        "productivity": "focusing on maximizing crop productivity and yield",
-        "reduction": "prioritizing herbicide reduction and organic alternatives"
-    }
-    base_prompt = prompts.get(analysis_type, prompts["general"])
-    focus_addition = focus_additions.get(focus, focus_additions["sustainability"])
-    return f"{base_prompt} {focus_addition}. Consider IFT values, crop rotations, and environmental impact in your analysis."
-# --------- Interface Gradio principale ---------
-demo = create_gradio_app()
-# --------- Lancement avec serveur MCP intégré ---------
 if __name__ == "__main__":
     demo.launch(
-        mcp_server=True,  # Active le serveur MCP intégré
-        server_name="0.0.0.0",
-        server_port=PORT,
-        share=False
-    )
-# ========= Configuration MCP pour clients =========
-# L'endpoint MCP sera disponible à : https://hackathoncra-mcp.hf.space/gradio_api/mcp/sse
-#
-# Configuration pour MCP Inspector ou autres clients:
-# {
-#   "mcpServers": {
-#     "agricultural-analysis": {
-#       "url": "https://hackathoncra-mcp.hf.space/gradio_api/mcp/sse"
-#     }
-#   }
-# }
-#
-# Pour Claude Desktop (avec mcp-remote):
-# {
-#   "mcpServers": {
-#     "agricultural-analysis": {
-#       "command": "npx",
-#       "args": [
-#         "mcp-remote",
-#         "https://hackathoncra-mcp.hf.space/gradio_api/mcp/sse"
-#       ]
-#     }
-#   }
-# }

 """
+Main application launcher for Hugging Face deployment
+"""
+import os
+from mcp_server import create_mcp_interface
+# Hugging Face configuration
+os.environ.setdefault("HF_TOKEN", os.environ.get("HF_TOKEN"))
+os.environ.setdefault("DATASET_ID", "HackathonCRA/2024")
 if __name__ == "__main__":
+    demo = create_mcp_interface()
     demo.launch(
+        mcp_server=True,
+        server_name="0.0.0.0",
+        server_port=7860,
+        share=True
+    )

demo.py DELETED Viewed

@@ -1,218 +0,0 @@
-#!/usr/bin/env python3
-"""
-Demo script for the Agricultural Analysis Tool
-Showcases the main features and functionality of the MCP server and analysis tools.
-"""
-import warnings
-warnings.filterwarnings('ignore')
-from data_loader import AgriculturalDataLoader
-from analysis_tools import AgriculturalAnalyzer
-import pandas as pd
-def main():
-    """Run the demo of agricultural analysis features."""
-    print("🚜" + "="*60)
-    print("    AGRICULTURAL ANALYSIS TOOL - DEMO")
-    print("    Station Expérimentale de Kerguéhennec")
-    print("="*63)
-    print()
-    # Initialize components
-    print("🔧 Initializing components...")
-    data_loader = AgriculturalDataLoader()
-    analyzer = AgriculturalAnalyzer(data_loader)
-    print("✅ Components initialized successfully")
-    print()
-    # Load data
-    print("📊 Loading agricultural intervention data...")
-    df = data_loader.load_all_files()
-    print(f"✅ Loaded {len(df):,} intervention records")
-    print(f"📅 Data spans {df.year.nunique()} years: {sorted(df.year.unique())}")
-    print(f"🌱 Covers {df.crop_type.nunique()} different crop types")
-    print(f"📍 Across {df.plot_name.nunique()} different plots")
-    print(f"💊 Including {df.is_herbicide.sum():,} herbicide applications")
-    print()
-    # Show top crops and plots
-    print("🌾 TOP CROPS ANALYZED:")
-    top_crops = df.crop_type.value_counts().head(10)
-    for i, (crop, count) in enumerate(top_crops.items(), 1):
-        print(f"   {i:2}. {crop:<30} ({count:3} interventions)")
-    print()
-    print("📍 TOP PLOTS ANALYZED:")
-    top_plots = df.plot_name.value_counts().head(10)
-    for i, (plot, count) in enumerate(top_plots.items(), 1):
-        print(f"   {i:2}. {plot:<30} ({count:3} interventions)")
-    print()
-    # Analyze weed pressure
-    print("🌿 WEED PRESSURE ANALYSIS (IFT - Treatment Frequency Index)")
-    print("-" * 60)
-    trends = analyzer.analyze_weed_pressure_trends()
-    summary = trends['summary']
-    print(f"📈 Overall IFT Statistics:")
-    print(f"   • Mean IFT:           {summary['mean_ift']:.2f}")
-    print(f"   • Standard deviation: {summary['std_ift']:.2f}")
-    print(f"   • Minimum IFT:        {summary['min_ift']:.2f}")
-    print(f"   • Maximum IFT:        {summary['max_ift']:.2f}")
-    print()
-    # Show IFT trends by year
-    if 'yearly_ift' in trends:
-        yearly_data = pd.DataFrame(trends['yearly_ift'])
-        print("📊 IFT Evolution by Year:")
-        for _, row in yearly_data.iterrows():
-            year = int(row['year'])
-            ift = row['ift_herbicide']
-            risk_indicator = "🟢" if ift < 1.0 else "🟡" if ift < 2.0 else "🔴"
-            print(f"   {year}: {ift:.2f} {risk_indicator}")
-        print()
-    # Prediction demo
-    print("🔮 WEED PRESSURE PREDICTIONS (2025-2027)")
-    print("-" * 60)
-    try:
-        predictions = analyzer.predict_weed_pressure(target_years=[2025, 2026, 2027])
-        model_perf = predictions['model_performance']
-        print(f"🤖 Model Performance:")
-        print(f"   • R² Score: {model_perf['r2']:.3f}")
-        print(f"   • Mean Squared Error: {model_perf['mse']:.3f}")
-        print()
-        # Show predictions for each year
-        for year in [2025, 2026, 2027]:
-            if year in predictions['predictions']:
-                year_pred = predictions['predictions'][year]
-                print(f"📅 Predictions for {year}:")
-                # Group by risk level
-                risk_counts = year_pred['risk_level'].value_counts()
-                for risk_level in ['low', 'medium', 'high']:
-                    count = risk_counts.get(risk_level, 0)
-                    emoji = {"low": "🟢", "medium": "🟡", "high": "🔴"}[risk_level]
-                    print(f"   {emoji} {risk_level.capitalize()} risk: {count} plots")
-                # Show a few examples
-                low_risk = year_pred[year_pred['risk_level'] == 'low']
-                if len(low_risk) > 0:
-                    print(f"   🌱 Best plots for sensitive crops:")
-                    for _, row in low_risk.head(5).iterrows():
-                        print(f"      • {row['plot_name']}: IFT {row['predicted_ift']:.2f}")
-                print()
-    except Exception as e:
-        print(f"❌ Prediction error: {e}")
-        print()
-    # Suitable plots for sensitive crops
-    print("🎯 PLOTS SUITABLE FOR SENSITIVE CROPS (peas, beans)")
-    print("-" * 60)
-    try:
-        suitable_plots = analyzer.identify_suitable_plots_for_sensitive_crops(
-            target_years=[2025, 2026, 2027],
-            max_ift_threshold=1.0
-        )
-        for year, plots in suitable_plots.items():
-            print(f"📅 {year}: {len(plots)} suitable plots")
-            if plots:
-                for plot in plots[:5]:  # Show first 5
-                    print(f"   ✅ {plot}")
-                if len(plots) > 5:
-                    print(f"   ... and {len(plots) - 5} more")
-            else:
-                print("   ❌ No plots meet the criteria")
-            print()
-    except Exception as e:
-        print(f"❌ Analysis error: {e}")
-        print()
-    # Crop rotation analysis
-    print("🔄 CROP ROTATION IMPACT ANALYSIS")
-    print("-" * 60)
-    try:
-        rotation_impact = analyzer.analyze_crop_rotation_impact()
-        if not rotation_impact.empty:
-            print("🏆 Best rotations (lowest average IFT):")
-            best_rotations = rotation_impact.head(10)
-            for i, (_, row) in enumerate(best_rotations.iterrows(), 1):
-                print(f"   {i:2}. {row['rotation_type']:<40} IFT: {row['mean_ift']:.2f}")
-            print()
-            print("⚠️  Worst rotations (highest average IFT):")
-            worst_rotations = rotation_impact.tail(5)
-            for i, (_, row) in enumerate(worst_rotations.iterrows(), 1):
-                print(f"   {i:2}. {row['rotation_type']:<40} IFT: {row['mean_ift']:.2f}")
-        else:
-            print("❌ Insufficient data for rotation analysis")
-        print()
-    except Exception as e:
-        print(f"❌ Rotation analysis error: {e}")
-        print()
-    # Herbicide usage analysis
-    print("💊 HERBICIDE USAGE ANALYSIS")
-    print("-" * 60)
-    try:
-        herbicide_analysis = analyzer.analyze_herbicide_alternatives()
-        print("📈 Most frequently used herbicides:")
-        top_herbicides = herbicide_analysis.head(10)
-        for i, (_, row) in enumerate(top_herbicides.iterrows(), 1):
-            crop_info = f" ({row['crop_type']})" if pd.notna(row['crop_type']) else ""
-            print(f"   {i:2}. {row['produit']:<30}{crop_info}")
-            print(f"       Applications: {row['applications']:<3} | Total qty: {row['total_quantity']:.1f}")
-        print()
-    except Exception as e:
-        print(f"❌ Herbicide analysis error: {e}")
-        print()
-    # Summary and recommendations
-    print("📋 SUMMARY AND RECOMMENDATIONS")
-    print("="*60)
-    print("✅ ACHIEVEMENTS:")
-    print("   • Successfully loaded and analyzed 10 years of intervention data")
-    print("   • Calculated weed pressure trends using IFT methodology")
-    print("   • Developed predictive model for future weed pressure")
-    print("   • Identified suitable plots for sensitive crops")
-    print("   • Analyzed impact of crop rotations")
-    print()
-    print("🎯 KEY INSIGHTS:")
-    avg_ift = summary['mean_ift']
-    if avg_ift < 1.0:
-        print("   • Overall weed pressure is LOW - good for sensitive crops")
-    elif avg_ift < 2.0:
-        print("   • Overall weed pressure is MODERATE - requires monitoring")
-    else:
-        print("   • Overall weed pressure is HIGH - needs intervention")
-    print(f"   • Current average IFT: {avg_ift:.2f}")
-    print(f"   • {df.plot_name.nunique()} plots available for analysis")
-    print(f"   • {df.crop_type.nunique()} different crop types in rotation")
-    print()
-    print("🚀 NEXT STEPS:")
-    print("   • Use the Gradio interface for interactive analysis")
-    print("   • Deploy on Hugging Face Spaces for broader access")
-    print("   • Configure MCP server for LLM integration")
-    print("   • Upload dataset to Hugging Face Hub")
-    print()
-    print("🌐 ACCESS THE TOOL:")
-    print("   • Gradio Interface: python gradio_app.py")
-    print("   • MCP Server: python mcp_server.py")
-    print("   • HF Deployment: python app.py")
-    print()
-    print("🚜" + "="*60)
-    print("    DEMO COMPLETED SUCCESSFULLY!")
-    print("="*63)
-if __name__ == "__main__":
-    main()

gradio_app.py DELETED Viewed

@@ -1,474 +0,0 @@
-"""
-Gradio interface for the Agricultural MCP Server.
-Provides a web interface for interacting with agricultural data analysis tools.
-"""
-import gradio as gr
-import json
-import pandas as pd
-import plotly.express as px
-import plotly.graph_objects as go
-from plotly.subplots import make_subplots
-import os
-from data_loader import AgriculturalDataLoader
-from analysis_tools import AgriculturalAnalyzer
-# Initialize components
-# Use Hugging Face dataset exclusively
-data_loader = AgriculturalDataLoader()
-print("🤗 Configured to use Hugging Face dataset exclusively")
-analyzer = AgriculturalAnalyzer(data_loader)
-# Global state for data
-def load_initial_data():
-    """Load and cache initial data."""
-    try:
-        df = data_loader.load_all_files()
-        return df
-    except Exception as e:
-        print(f"Error loading data: {e}")
-        return pd.DataFrame()
-def get_data_summary():
-    """Get summary of the agricultural data."""
-    try:
-        df = load_initial_data()
-        if df.empty:
-            return "Aucune donnée disponible"
-        summary = f"""
-        ## Résumé des Données Agricoles - Station Expérimentale de Kerguéhennec
-        📊 **Statistiques Générales:**
-        - **Total d'enregistrements:** {len(df):,}
-        - **Parcelles uniques:** {df['plot_name'].nunique()}
-        - **Types de cultures:** {df['crop_type'].nunique()}
-        - **Années couvertes:** {', '.join(map(str, sorted(df['year'].unique())))}
-        - **Applications herbicides:** {len(df[df['is_herbicide'] == True]):,}
-        🌱 **Cultures principales:**
-        {df['crop_type'].value_counts().head(5).to_string()}
-        📍 **Parcelles principales:**
-        {df['plot_name'].value_counts().head(5).to_string()}
-        """
-        return summary
-    except Exception as e:
-        return f"Erreur lors du chargement des données: {str(e)}"
-def filter_and_analyze_data(years, plots, crops):
-    """Filter data and provide analysis."""
-    try:
-        df = load_initial_data()
-        if df.empty:
-            return "Aucune donnée disponible", None
-        # Convert inputs to lists if not None
-        year_list = [int(y) for y in years] if years else None
-        plot_list = plots if plots else None
-        crop_list = crops if crops else None
-        # Filter data
-        filtered_df = data_loader.filter_data(
-            years=year_list,
-            plots=plot_list,
-            crops=crop_list
-        )
-        if filtered_df.empty:
-            return "Aucune donnée trouvée avec ces filtres", None
-        # Generate analysis
-        analysis = f"""
-        ## Analyse des Données Filtrées
-        **Filtres appliqués:**
-        - Années: {years if years else 'Toutes'}
-        - Parcelles: {', '.join(plots) if plots else 'Toutes'}
-        - Cultures: {', '.join(crops) if crops else 'Toutes'}
-        **Résultats:**
-        - Enregistrements filtrés: {len(filtered_df):,}
-        - Applications herbicides: {len(filtered_df[filtered_df['is_herbicide'] == True]):,}
-        - Parcelles concernées: {filtered_df['plot_name'].nunique()}
-        - Cultures concernées: {filtered_df['crop_type'].nunique()}
-        **Distribution par année:**
-        {filtered_df['year'].value_counts().sort_index().to_string()}
-        """
-        # Create visualization
-        yearly_dist = filtered_df['year'].value_counts().sort_index()
-        fig = px.bar(
-            x=yearly_dist.index,
-            y=yearly_dist.values,
-            title="Distribution des Interventions par Année",
-            labels={'x': 'Année', 'y': 'Nombre d\'Interventions'}
-        )
-        return analysis, fig
-    except Exception as e:
-        return f"Erreur lors de l'analyse: {str(e)}", None
-def analyze_weed_pressure(years, plots):
-    """Analyze weed pressure trends."""
-    try:
-        # Convert inputs
-        year_list = [int(y) for y in years] if years else None
-        plot_list = plots if plots else None
-        # Get analysis
-        trends = analyzer.analyze_weed_pressure_trends(years=year_list, plots=plot_list)
-        # Format results
-        summary_stats = trends['summary']
-        analysis_text = f"""
-        ## Analyse de la Pression Adventices (IFT Herbicides)
-        **Statistiques globales:**
-        - IFT moyen: {summary_stats['mean_ift']:.2f}
-        - Écart-type: {summary_stats['std_ift']:.2f}
-        - IFT minimum: {summary_stats['min_ift']:.2f}
-        - IFT maximum: {summary_stats['max_ift']:.2f}
-        - Total applications: {summary_stats['total_applications']}
-        - Parcelles analysées: {summary_stats['unique_plots']}
-        - Cultures analysées: {summary_stats['unique_crops']}
-        **Interprétation:**
-        - IFT < 1.0: Pression faible (adapté aux cultures sensibles)
-        - IFT 1.0-2.0: Pression modérée
-        - IFT > 2.0: Pression élevée
-        """
-        # Create visualization
-        fig = analyzer.create_weed_pressure_visualization(years=year_list, plots=plot_list)
-        return analysis_text, fig
-    except Exception as e:
-        return f"Erreur lors de l'analyse de pression: {str(e)}", None
-def predict_future_weed_pressure(target_years, max_ift):
-    """Predict weed pressure for future years."""
-    try:
-        # Convert target years
-        year_list = [int(y) for y in target_years] if target_years else [2025, 2026, 2027]
-        # Get predictions
-        predictions = analyzer.predict_weed_pressure(target_years=year_list)
-        # Format results
-        model_perf = predictions['model_performance']
-        results_text = f"""
-        ## Prédiction de la Pression Adventices
-        **Performance du modèle:**
-        - R² Score: {model_perf['r2']:.3f}
-        - Erreur quadratique moyenne: {model_perf['mse']:.3f}
-        **Prédictions par année:**
-        """
-        # Add predictions for each year
-        prediction_data = []
-        for year in year_list:
-            if year in predictions['predictions']:
-                year_pred = predictions['predictions'][year]
-                results_text += f"\n**{year}:**\n"
-                for _, row in year_pred.iterrows():
-                    results_text += f"- {row['plot_name']}: IFT {row['predicted_ift']:.2f} (Risque: {row['risk_level']})\n"
-                    prediction_data.append({
-                        'Année': year,
-                        'Parcelle': row['plot_name'],
-                        'IFT_Prédit': row['predicted_ift'],
-                        'Niveau_Risque': row['risk_level']
-                    })
-        # Identify suitable plots
-        suitable_plots = analyzer.identify_suitable_plots_for_sensitive_crops(
-            target_years=year_list,
-            max_ift_threshold=max_ift
-        )
-        results_text += f"\n\n**Parcelles adaptées aux cultures sensibles (IFT < {max_ift}):**\n"
-        for year, plots in suitable_plots.items():
-            if plots:
-                results_text += f"- {year}: {', '.join(plots)}\n"
-            else:
-                results_text += f"- {year}: Aucune parcelle adaptée\n"
-        # Create visualization
-        if prediction_data:
-            pred_df = pd.DataFrame(prediction_data)
-            fig = px.scatter(
-                pred_df,
-                x='Année',
-                y='IFT_Prédit',
-                color='Niveau_Risque',
-                size='IFT_Prédit',
-                hover_data=['Parcelle'],
-                title="Prédictions IFT par Parcelle et Année",
-                color_discrete_map={'low': 'green', 'medium': 'orange', 'high': 'red'}
-            )
-            fig.add_hline(y=max_ift, line_dash="dash", line_color="red",
-                         annotation_text=f"Seuil cultures sensibles ({max_ift})")
-            return results_text, fig
-        else:
-            return results_text, None
-    except Exception as e:
-        return f"Erreur lors de la prédiction: {str(e)}", None
-def analyze_crop_rotation():
-    """Analyze crop rotation impact."""
-    try:
-        rotation_impact = analyzer.analyze_crop_rotation_impact()
-        if rotation_impact.empty:
-            return "Pas assez de données pour analyser les rotations", None
-        analysis_text = f"""
-        ## Impact des Rotations sur la Pression Adventices
-        **Rotations les plus favorables (IFT moyen le plus bas):**
-        """
-        # Show top 10 best rotations
-        best_rotations = rotation_impact.head(10)
-        for _, row in best_rotations.iterrows():
-            analysis_text += f"\n- **{row['rotation_type']}**"
-            analysis_text += f"\n  - IFT moyen: {row['mean_ift']:.2f}"
-            analysis_text += f"\n  - Écart-type: {row['std_ift']:.2f}"
-            analysis_text += f"\n  - Observations: {row['count']}\n"
-        # Create visualization
-        top_20 = rotation_impact.head(20)
-        fig = px.bar(
-            top_20,
-            x='mean_ift',
-            y='rotation_type',
-            orientation='h',
-            title="Impact des Rotations sur l'IFT Herbicide (Top 20)",
-            labels={'mean_ift': 'IFT Moyen', 'rotation_type': 'Type de Rotation'},
-            color='mean_ift',
-            color_continuous_scale='RdYlGn_r'
-        )
-        fig.update_layout(height=800)
-        return analysis_text, fig
-    except Exception as e:
-        return f"Erreur lors de l'analyse des rotations: {str(e)}", None
-def analyze_herbicide_usage():
-    """Analyze herbicide usage patterns."""
-    try:
-        herbicide_analysis = analyzer.analyze_herbicide_alternatives()
-        analysis_text = f"""
-        ## Analyse des Herbicides Utilisés
-        **Herbicides les plus utilisés:**
-        """
-        top_herbicides = herbicide_analysis.head(15)
-        for _, row in top_herbicides.iterrows():
-            analysis_text += f"\n- **{row['produit']}** ({row['crop_type']})"
-            analysis_text += f"\n  - Applications: {row['applications']}"
-            analysis_text += f"\n  - Quantité totale: {row['total_quantity']:.1f}"
-            analysis_text += f"\n  - Quantité moyenne: {row['avg_quantity']:.1f}"
-            if not pd.isna(row['amm_code']):
-                analysis_text += f"\n  - Code AMM: {row['amm_code']}"
-            analysis_text += "\n"
-        # Create visualization
-        fig = px.bar(
-            top_herbicides.head(10),
-            x='applications',
-            y='produit',
-            orientation='h',
-            title="Herbicides les Plus Utilisés (Nombre d'Applications)",
-            labels={'applications': 'Nombre d\'Applications', 'produit': 'Produit'},
-            color='applications'
-        )
-        fig.update_layout(height=600)
-        return analysis_text, fig
-    except Exception as e:
-        return f"Erreur lors de l'analyse des herbicides: {str(e)}", None
-# Create Gradio interface
-def create_gradio_app():
-    """Create the Gradio application."""
-    # Load data for dropdowns
-    try:
-        df = load_initial_data()
-        available_years = sorted(df['year'].unique()) if not df.empty else []
-        available_plots = sorted(df['plot_name'].unique()) if not df.empty else []
-        available_crops = sorted(df['crop_type'].unique()) if not df.empty else []
-    except:
-        available_years = []
-        available_plots = []
-        available_crops = []
-    with gr.Blocks(title="🚜 Analyse Agricole - Station de Kerguéhennec", theme=gr.themes.Soft()) as app:
-        gr.Markdown("""
-        # 🚜 Analyse des Données Agricoles
-        ## Station Expérimentale de Kerguéhennec
-        ### Outil d'aide à la décision pour la réduction des herbicides et l'identification des parcelles adaptées aux cultures sensibles
-        """)
-        with gr.Tabs():
-            # Tab 1: Data Overview
-            with gr.Tab("📊 Aperçu des Données"):
-                gr.Markdown("## Résumé des données disponibles")
-                summary_output = gr.Markdown(value=get_data_summary())
-                refresh_btn = gr.Button("🔄 Actualiser", variant="secondary")
-                refresh_btn.click(get_data_summary, outputs=summary_output)
-            # Tab 2: Data Filtering
-            with gr.Tab("🔍 Filtrage et Exploration"):
-                gr.Markdown("## Filtrer et explorer les données")
-                with gr.Row():
-                    with gr.Column():
-                        years_filter = gr.CheckboxGroup(
-                            choices=[str(y) for y in available_years],
-                            label="Années",
-                            value=[str(y) for y in available_years[-3:]] if available_years else []
-                        )
-                        plots_filter = gr.CheckboxGroup(
-                            choices=available_plots,
-                            label="Parcelles",
-                            value=available_plots[:5] if available_plots else []
-                        )
-                        crops_filter = gr.CheckboxGroup(
-                            choices=available_crops,
-                            label="Cultures",
-                            value=available_crops[:5] if available_crops else []
-                        )
-                        analyze_btn = gr.Button("📈 Analyser", variant="primary")
-                with gr.Column():
-                    filter_results = gr.Markdown()
-                    filter_plot = gr.Plot()
-                analyze_btn.click(
-                    filter_and_analyze_data,
-                    inputs=[years_filter, plots_filter, crops_filter],
-                    outputs=[filter_results, filter_plot]
-                )
-            # Tab 3: Weed Pressure Analysis
-            with gr.Tab("🌿 Pression Adventices"):
-                gr.Markdown("## Analyse de la pression adventices (IFT Herbicides)")
-                with gr.Row():
-                    with gr.Column():
-                        years_pressure = gr.CheckboxGroup(
-                            choices=[str(y) for y in available_years],
-                            label="Années à analyser",
-                            value=[str(y) for y in available_years] if available_years else []
-                        )
-                        plots_pressure = gr.CheckboxGroup(
-                            choices=available_plots,
-                            label="Parcelles à analyser",
-                            value=available_plots if len(available_plots) <= 10 else available_plots[:10]
-                        )
-                        pressure_btn = gr.Button("🔬 Analyser la Pression", variant="primary")
-                with gr.Column():
-                    pressure_results = gr.Markdown()
-                    pressure_plot = gr.Plot()
-                pressure_btn.click(
-                    analyze_weed_pressure,
-                    inputs=[years_pressure, plots_pressure],
-                    outputs=[pressure_results, pressure_plot]
-                )
-            # Tab 4: Predictions
-            with gr.Tab("🔮 Prédictions"):
-                gr.Markdown("## Prédiction de la pression adventices")
-                with gr.Row():
-                    with gr.Column():
-                        target_years = gr.CheckboxGroup(
-                            choices=["2025", "2026", "2027"],
-                            label="Années à prédire",
-                            value=["2025", "2026", "2027"]
-                        )
-                        max_ift = gr.Slider(
-                            minimum=0.5,
-                            maximum=3.0,
-                            value=1.0,
-                            step=0.1,
-                            label="Seuil IFT max pour cultures sensibles"
-                        )
-                        predict_btn = gr.Button("🎯 Prédire", variant="primary")
-                with gr.Column():
-                    prediction_results = gr.Markdown()
-                    prediction_plot = gr.Plot()
-                predict_btn.click(
-                    predict_future_weed_pressure,
-                    inputs=[target_years, max_ift],
-                    outputs=[prediction_results, prediction_plot]
-                )
-            # Tab 5: Crop Rotation
-            with gr.Tab("🔄 Rotations"):
-                gr.Markdown("## Impact des rotations culturales")
-                rotation_btn = gr.Button("📊 Analyser les Rotations", variant="primary")
-                rotation_results = gr.Markdown()
-                rotation_plot = gr.Plot()
-                rotation_btn.click(
-                    analyze_crop_rotation,
-                    outputs=[rotation_results, rotation_plot]
-                )
-            # Tab 6: Herbicide Analysis
-            with gr.Tab("💊 Herbicides"):
-                gr.Markdown("## Analyse des herbicides utilisés")
-                herbicide_btn = gr.Button("🧪 Analyser les Herbicides", variant="primary")
-                herbicide_results = gr.Markdown()
-                herbicide_plot = gr.Plot()
-                herbicide_btn.click(
-                    analyze_herbicide_usage,
-                    outputs=[herbicide_results, herbicide_plot]
-                )
-        gr.Markdown("""
-        ---
-        **Note:** Cet outil utilise les données historiques d'interventions de la Station Expérimentale de Kerguéhennec
-        pour analyser la pression adventices et identifier les parcelles les plus adaptées aux cultures sensibles
-        comme le pois et le haricot.
-        """)
-    return app
-# Launch the app
-if __name__ == "__main__":
-    app = create_gradio_app()
-    app.launch(
-        server_name="0.0.0.0",
-        server_port=7860,
-        share=True,
-        debug=True
-    )

hf_integration.py DELETED Viewed

@@ -1,313 +0,0 @@
-"""
-Hugging Face integration for dataset management and model deployment.
-"""
-import os
-import pandas as pd
-from datasets import Dataset, DatasetDict
-from huggingface_hub import HfApi, create_repo, upload_file
-from pathlib import Path
-from typing import Optional, Dict, Any
-import json
-class HuggingFaceIntegration:
-    """Handles Hugging Face dataset and model operations."""
-    def __init__(self, token: Optional[str] = None, dataset_id: str = "HackathonCRA/2024"):
-        self.token = token or os.environ.get("HF_TOKEN")
-        self.dataset_id = dataset_id
-        self.api = HfApi(token=self.token) if self.token else None
-    def prepare_dataset_from_local_files(self, data_path: str) -> Dataset:
-        """Prepare dataset from local CSV/Excel files."""
-        from data_loader import AgriculturalDataLoader
-        # Load and combine all data files
-        loader = AgriculturalDataLoader(data_path=data_path)
-        df = loader.load_all_files()
-        # Convert to Hugging Face Dataset
-        dataset = Dataset.from_pandas(df)
-        return dataset
-    def upload_dataset(self, data_path: str, private: bool = False) -> str:
-        """Upload agricultural data to Hugging Face Hub."""
-        if not self.token:
-            raise ValueError("HF_TOKEN required for uploading")
-        # Prepare dataset
-        dataset = self.prepare_dataset_from_local_files(data_path)
-        # Create repository if it doesn't exist
-        try:
-            create_repo(
-                repo_id=self.dataset_id,
-                token=self.token,
-                repo_type="dataset",
-                private=private,
-                exist_ok=True
-            )
-        except Exception as e:
-            print(f"Repository might already exist: {e}")
-        # Upload dataset
-        dataset.push_to_hub(
-            repo_id=self.dataset_id,
-            token=self.token,
-            private=private
-        )
-        return f"Dataset uploaded to https://huggingface.co/datasets/{self.dataset_id}"
-    def create_dataset_card(self) -> str:
-        """Create a dataset card for the agricultural data."""
-        card_content = """
----
-license: cc-by-4.0
-task_categories:
-- tabular-regression
-- time-series-forecasting
-language:
-- fr
-tags:
-- agriculture
-- herbicides
-- weed-pressure
-- crop-rotation
-- france
-- bretagne
-size_categories:
-- 1K<n<10K
----
-# 🚜 Station Expérimentale de Kerguéhennec - Agricultural Interventions Dataset
-## Dataset Description
-This dataset contains agricultural intervention records from the Station Expérimentale de Kerguéhennec in Brittany, France, spanning from 2014 to 2024. The data includes detailed information about agricultural practices, crop rotations, herbicide treatments, and field management operations.
-## Dataset Summary
-- **Source**: Station Expérimentale de Kerguéhennec
-- **Time Period**: 2014-2024
-- **Location**: Brittany, France
-- **Records**: ~10,000+ intervention records
-- **Format**: CSV/Excel exports from farm management system
-## Use Cases
-This dataset is particularly valuable for:
-1. **Weed Pressure Analysis**: Calculate and predict Treatment Frequency Index (IFT) for herbicides
-2. **Crop Rotation Optimization**: Analyze the impact of different crop sequences on pest pressure
-3. **Sustainable Agriculture**: Support reduction of herbicide use while maintaining productivity
-4. **Precision Agriculture**: Identify suitable plots for sensitive crops (peas, beans)
-5. **Agricultural Research**: Study relationships between practices and outcomes
-## Data Fields
-### Core Fields
-- `millesime`: Year of intervention
-- `nomparc`: Plot/field name
-- `surfparc`: Plot surface area (hectares)
-- `libelleusag`: Crop type/usage
-- `datedebut`/`datefin`: Intervention start/end dates
-- `libevenem`: Intervention type
-- `familleprod`: Product family (herbicides, fungicides, etc.)
-- `produit`: Specific product used
-- `quantitetot`: Total quantity applied
-- `unite`: Unit of measurement
-### Derived Fields
-- `year`: Intervention year
-- `crop_type`: Standardized crop classification
-- `is_herbicide`: Boolean flag for herbicide treatments
-- `ift_herbicide`: Treatment Frequency Index calculation
-## Data Quality
-- All personal identifying information has been removed
-- Geographic coordinates are generalized to protect farm location
-- Product codes (AMM) are preserved for regulatory analysis
-- Missing values are clearly marked and documented
-## Methodology
-### IFT Calculation
-The Treatment Frequency Index (IFT) is calculated as:
-```
-IFT = Number of applications / Plot surface area
-```
-This metric is crucial for:
-- Regulatory compliance monitoring
-- Sustainable practice assessment
-- Risk evaluation for sensitive crops
-## Applications
-### 1. Weed Pressure Prediction
-Use machine learning models to predict future IFT values based on:
-- Historical treatment patterns
-- Crop rotation sequences
-- Environmental factors
-- Plot characteristics
-### 2. Sustainable Plot Selection
-Identify plots suitable for sensitive crops (peas, beans) by:
-- Analyzing historical IFT trends
-- Evaluating rotation impacts
-- Assessing risk levels
-### 3. Alternative Strategy Development
-Support herbicide reduction strategies through:
-- Product usage pattern analysis
-- Rotation optimization recommendations
-- Risk assessment frameworks
-## Citation
-If you use this dataset in your research, please cite:
-```
-@dataset{hackathon_cra_2024,
-  title={Station Expérimentale de Kerguéhennec Agricultural Interventions Dataset},
-  author={Hackathon CRA Team},
-  year={2024},
-  publisher={Hugging Face},
-  url={https://huggingface.co/datasets/HackathonCRA/2024}
-}
-```
-## License
-This dataset is released under CC-BY-4.0 license, allowing for both commercial and research use with proper attribution.
-## Contact
-For questions about this dataset or collaboration opportunities, please contact the research team through the Hugging Face dataset page.
----
-**Keywords**: agriculture, herbicides, crop rotation, sustainable farming, France, Brittany, IFT, weed management, precision agriculture
-"""
-        return card_content
-    def upload_app_space(self, local_app_path: str, space_name: str = "agricultural-analysis") -> str:
-        """Upload the Gradio app as a Hugging Face Space."""
-        if not self.token:
-            raise ValueError("HF_TOKEN required for uploading")
-        repo_id = f"{self.api.whoami()['name']}/{space_name}"
-        # Create Space repository
-        try:
-            create_repo(
-                repo_id=repo_id,
-                token=self.token,
-                repo_type="space",
-                space_sdk="gradio",
-                private=False,
-                exist_ok=True
-            )
-        except Exception as e:
-            print(f"Space might already exist: {e}")
-        # Upload files
-        app_files = [
-            "app.py",
-            "requirements.txt",
-            "gradio_app.py",
-            "data_loader.py",
-            "analysis_tools.py",
-            "mcp_server.py",
-            "README.md"
-        ]
-        for file_name in app_files:
-            file_path = Path(local_app_path) / file_name
-            if file_path.exists():
-                upload_file(
-                    path_or_fileobj=str(file_path),
-                    path_in_repo=file_name,
-                    repo_id=repo_id,
-                    repo_type="space",
-                    token=self.token
-                )
-                print(f"Uploaded {file_name}")
-        return f"Space created at https://huggingface.co/spaces/{repo_id}"
-    def create_space_readme(self) -> str:
-        """Create README for Hugging Face Space."""
-        readme_content = """
----
-title: Agricultural Analysis - Kerguéhennec
-emoji: 🚜
-colorFrom: green
-colorTo: blue
-sdk: gradio
-sdk_version: 4.0.0
-app_file: app.py
-pinned: false
-license: cc-by-4.0
----
-# 🚜 Agricultural Analysis - Station de Kerguéhennec
-Outil d'analyse des données agricoles pour l'optimisation des pratiques phytosanitaires et l'identification des parcelles adaptées aux cultures sensibles.
-## Fonctionnalités
-- 📊 Analyse des données d'interventions agricoles
-- 🌿 Évaluation de la pression adventices (IFT)
-- 🔮 Prédictions pour les 3 prochaines années
-- 🔄 Analyse de l'impact des rotations culturales
-- 💊 Étude des herbicides utilisés
-- 🎯 Identification des parcelles pour cultures sensibles
-## Utilisation
-1. Sélectionnez l'onglet correspondant à votre analyse
-2. Configurez les filtres selon vos besoins
-3. Lancez l'analyse pour obtenir les résultats
-4. Explorez les visualisations interactives
-## Données
-Basé sur les données de la Station Expérimentale de Kerguéhennec (2014-2024).
-"""
-        return readme_content
-    def setup_environment_variables(self) -> Dict[str, str]:
-        """Setup environment variables for Hugging Face deployment."""
-        env_vars = {
-            "HF_TOKEN": self.token or "your_hf_token_here",
-            "DATASET_ID": self.dataset_id,
-            "GRADIO_SERVER_NAME": "0.0.0.0",
-            "GRADIO_SERVER_PORT": "7860"
-        }
-        return env_vars
-# Usage example
-if __name__ == "__main__":
-    # Initialize HF integration
-    hf = HuggingFaceIntegration()
-    # Upload dataset (requires HF_TOKEN)
-    if hf.token:
-        try:
-            result = hf.upload_dataset("/Users/tracyandre/Downloads/OneDrive_1_9-17-2025")
-            print(result)
-        except Exception as e:
-            print(f"Dataset upload failed: {e}")
-    # Create dataset card
-    card = hf.create_dataset_card()
-    print("Dataset card created")
-    # Show environment setup
-    env_vars = hf.setup_environment_variables()
-    print("Environment variables:", env_vars)

hf_usage_example.py DELETED Viewed

@@ -1,214 +0,0 @@
-#!/usr/bin/env python3
-"""
-Example usage of the agricultural data loader with Hugging Face integration.
-Shows different ways to load and use the data.
-"""
-import os
-import warnings
-warnings.filterwarnings('ignore')
-from data_loader import AgriculturalDataLoader
-from analysis_tools import AgriculturalAnalyzer
-def example_local_usage():
-    """Example: Load from local files."""
-    print("📁 EXAMPLE 1: Loading from local files")
-    print("-" * 40)
-    # Create loader for local files
-    loader = AgriculturalDataLoader.create_local_loader(
-        data_path="/Users/tracyandre/Downloads/OneDrive_1_9-17-2025"
-    )
-    # Load and analyze data
-    df = loader.load_all_files()
-    print(f"✅ Loaded {len(df):,} records from local files")
-    # Basic analysis
-    analyzer = AgriculturalAnalyzer(loader)
-    trends = analyzer.analyze_weed_pressure_trends()
-    print(f"📊 Average IFT: {trends['summary']['mean_ift']:.2f}")
-    return df
-def example_hf_usage():
-    """Example: Load from Hugging Face (if available)."""
-    print("\n🤗 EXAMPLE 2: Loading from Hugging Face")
-    print("-" * 40)
-    # Check if HF token is available
-    if not os.environ.get("HF_TOKEN"):
-        print("⚠️  No HF_TOKEN found - skipping HF example")
-        print("💡 Set HF_TOKEN environment variable to use this feature")
-        return None
-    try:
-        # Create loader for Hugging Face
-        loader = AgriculturalDataLoader.create_hf_loader(
-            dataset_id="HackathonCRA/2024"
-        )
-        # Load and analyze data
-        df = loader.load_all_files()
-        print(f"✅ Loaded {len(df):,} records from Hugging Face")
-        # Basic analysis
-        analyzer = AgriculturalAnalyzer(loader)
-        trends = analyzer.analyze_weed_pressure_trends()
-        print(f"📊 Average IFT: {trends['summary']['mean_ift']:.2f}")
-        return df
-    except Exception as e:
-        print(f"❌ Failed to load from Hugging Face: {e}")
-        return None
-def example_automatic_fallback():
-    """Example: Automatic fallback from HF to local."""
-    print("\n🔄 EXAMPLE 3: Automatic fallback")
-    print("-" * 40)
-    # Create loader with HF preferred but local fallback
-    loader = AgriculturalDataLoader(
-        data_path="/Users/tracyandre/Downloads/OneDrive_1_9-17-2025",
-        dataset_id="HackathonCRA/2024",
-        use_hf=True  # Try HF first
-    )
-    # This will try HF first, then fallback to local if needed
-    df = loader.load_all_files()
-    print(f"✅ Loaded {len(df):,} records (with automatic source selection)")
-    return df
-def example_dynamic_switching():
-    """Example: Dynamic switching between sources."""
-    print("\n🔀 EXAMPLE 4: Dynamic source switching")
-    print("-" * 40)
-    # Create loader
-    loader = AgriculturalDataLoader(
-        data_path="/Users/tracyandre/Downloads/OneDrive_1_9-17-2025",
-        dataset_id="HackathonCRA/2024"
-    )
-    # Load from local first
-    loader.set_data_source(use_hf=False)
-    df_local = loader.load_all_files()
-    print(f"📁 Local source: {len(df_local):,} records")
-    # Switch to HF (if available)
-    if os.environ.get("HF_TOKEN"):
-        try:
-            loader.set_data_source(use_hf=True)
-            df_hf = loader.load_all_files()
-            print(f"🤗 HF source: {len(df_hf):,} records")
-            # Compare
-            if len(df_local) == len(df_hf):
-                print("✅ Data consistency verified")
-            else:
-                print(f"⚠️  Data mismatch: {abs(len(df_local) - len(df_hf))} record difference")
-        except Exception as e:
-            print(f"🤗 HF switching failed: {e}")
-    else:
-        print("⚠️  No HF_TOKEN - skipping HF switch test")
-    return df_local
-def example_production_deployment():
-    """Example: Production deployment configuration."""
-    print("\n🚀 EXAMPLE 5: Production deployment setup")
-    print("-" * 40)
-    # Production configuration
-    # This is how you'd set it up for Hugging Face Spaces deployment
-    print("💡 For Hugging Face Spaces deployment:")
-    print("1. Set HF_TOKEN as a Space secret")
-    print("2. Configure the loader as follows:")
-    print()
-    config_code = '''
-# In your app.py or gradio_app.py
-import os
-from data_loader import AgriculturalDataLoader
-# Production configuration
-hf_token = os.environ.get("HF_TOKEN")
-dataset_id = "HackathonCRA/2024"
-if hf_token:
-    # Use HF dataset in production
-    data_loader = AgriculturalDataLoader.create_hf_loader(
-        dataset_id=dataset_id,
-        hf_token=hf_token
-    )
-    print("🤗 Using Hugging Face dataset")
-else:
-    # Fallback for local development
-    data_loader = AgriculturalDataLoader.create_local_loader(
-        data_path="./data"  # Local data directory
-    )
-    print("📁 Using local files")
-'''
-    print(config_code)
-    # Example of actual production setup
-    try:
-        hf_token = os.environ.get("HF_TOKEN")
-        if hf_token:
-            loader = AgriculturalDataLoader.create_hf_loader("HackathonCRA/2024", hf_token)
-            print("✅ Production setup: HF dataset configured")
-        else:
-            loader = AgriculturalDataLoader.create_local_loader("/Users/tracyandre/Downloads/OneDrive_1_9-17-2025")
-            print("✅ Development setup: Local files configured")
-        df = loader.load_all_files()
-        print(f"📊 Ready for production: {len(df):,} records available")
-    except Exception as e:
-        print(f"❌ Production setup failed: {e}")
-def main():
-    """Run all examples."""
-    print("🚜 AGRICULTURAL DATA LOADER - USAGE EXAMPLES")
-    print("=" * 60)
-    # Run examples
-    example_local_usage()
-    example_hf_usage()
-    example_automatic_fallback()
-    example_dynamic_switching()
-    example_production_deployment()
-    print("\n" + "=" * 60)
-    print("🎯 SUMMARY")
-    print("=" * 60)
-    print("""
-The AgriculturalDataLoader now supports:
-✅ Local file loading (CSV/Excel)
-✅ Hugging Face dataset loading
-✅ Automatic fallback (HF → Local)
-✅ Dynamic source switching
-✅ Production deployment ready
-Key benefits:
-🔄 Seamless data source switching
-🚀 Cloud deployment ready
-📊 Same analysis tools work with both sources
-🔧 Easy configuration management
-    """)
-    print("🛠️  Next steps:")
-    print("1. Upload your dataset to Hugging Face Hub")
-    print("2. Set HF_TOKEN environment variable")
-    print("3. Deploy to Hugging Face Spaces")
-    print("4. Enjoy cloud-based agricultural analysis!")
-if __name__ == "__main__":
-    main()

launch.py DELETED Viewed

@@ -1,170 +0,0 @@
-#!/usr/bin/env python3
-"""
-Launch script for the Agricultural Analysis Tool
-Simple launcher with menu options for different modes.
-"""
-import sys
-import os
-import subprocess
-import warnings
-warnings.filterwarnings('ignore')
-def print_banner():
-    """Print the application banner."""
-    print("🚜" + "="*70)
-    print("    AGRICULTURAL ANALYSIS TOOL - STATION DE KERGUÉHENNEC")
-    print("    Hackathon CRA - Réduction des herbicides")
-    print("="*73)
-    print()
-def check_dependencies():
-    """Check if all required dependencies are installed."""
-    print("🔧 Checking dependencies...")
-    try:
-        import pandas, numpy, matplotlib, seaborn, sklearn, gradio, plotly
-        from data_loader import AgriculturalDataLoader
-        from analysis_tools import AgriculturalAnalyzer
-        print("✅ All dependencies are installed")
-        return True
-    except ImportError as e:
-        print(f"❌ Missing dependency: {e}")
-        print("Please run: pip install -r requirements.txt")
-        return False
-def test_data_loading():
-    """Test if data can be loaded successfully."""
-    print("📊 Testing data loading...")
-    try:
-        from data_loader import AgriculturalDataLoader
-        loader = AgriculturalDataLoader()
-        df = loader.load_all_files()
-        print(f"✅ Successfully loaded {len(df):,} records")
-        return True
-    except Exception as e:
-        print(f"❌ Data loading failed: {e}")
-        return False
-def launch_gradio():
-    """Launch the Gradio interface."""
-    print("🚀 Launching Gradio interface...")
-    print("📱 The app will open in your web browser")
-    print("🌐 Access at: http://localhost:7860")
-    print("⏹️  Press Ctrl+C to stop the server")
-    print()
-    try:
-        from gradio_app import create_gradio_app
-        app = create_gradio_app()
-        app.launch(
-            server_name="0.0.0.0",
-            server_port=7860,
-            share=False,
-            debug=False,
-            quiet=False
-        )
-    except KeyboardInterrupt:
-        print("\n🛑 Server stopped by user")
-    except Exception as e:
-        print(f"❌ Failed to launch Gradio: {e}")
-def launch_mcp_server():
-    """Launch the MCP server."""
-    print("🤖 Launching MCP Server...")
-    print("📡 Server will run in Model Context Protocol mode")
-    print("⏹️  Press Ctrl+C to stop the server")
-    print()
-    try:
-        subprocess.run([sys.executable, "mcp_server.py"])
-    except KeyboardInterrupt:
-        print("\n🛑 MCP Server stopped by user")
-    except Exception as e:
-        print(f"❌ Failed to launch MCP server: {e}")
-def run_demo():
-    """Run the demonstration."""
-    print("🎬 Running comprehensive demo...")
-    print()
-    try:
-        subprocess.run([sys.executable, "demo.py"])
-    except Exception as e:
-        print(f"❌ Demo failed: {e}")
-def show_menu():
-    """Show the main menu."""
-    print("📋 Choose an option:")
-    print()
-    print("1. 🌐 Launch Gradio Web Interface (Recommended)")
-    print("2. 🤖 Launch MCP Server")
-    print("3. 🎬 Run Demo")
-    print("4. 🔧 Check System Status")
-    print("5. ❌ Exit")
-    print()
-def main():
-    """Main launcher function."""
-    print_banner()
-    # Check dependencies first
-    if not check_dependencies():
-        return
-    # Test data loading
-    if not test_data_loading():
-        return
-    print("🎯 System ready!")
-    print()
-    while True:
-        show_menu()
-        try:
-            choice = input("Enter your choice (1-5): ").strip()
-            if choice == "1":
-                print()
-                launch_gradio()
-                print()
-            elif choice == "2":
-                print()
-                launch_mcp_server()
-                print()
-            elif choice == "3":
-                print()
-                run_demo()
-                print()
-                input("Press Enter to continue...")
-                print()
-            elif choice == "4":
-                print()
-                print("🔍 System Status Check:")
-                check_dependencies()
-                test_data_loading()
-                print()
-                input("Press Enter to continue...")
-                print()
-            elif choice == "5":
-                print()
-                print("👋 Goodbye! Thank you for using the Agricultural Analysis Tool")
-                break
-            else:
-                print("❌ Invalid choice. Please enter a number between 1-5.")
-                print()
-        except KeyboardInterrupt:
-            print("\n\n👋 Goodbye! Thank you for using the Agricultural Analysis Tool")
-            break
-        except Exception as e:
-            print(f"❌ Error: {e}")
-            print()
-if __name__ == "__main__":
-    main()

mcp.code-workspace DELETED Viewed

@@ -1,11 +0,0 @@
-{
-	"folders": [
-		{
-			"path": "."
-		},
-		{
-			"path": "../../../Downloads/OneDrive_1_9-17-2025"
-		}
-	],
-	"settings": {}
-}

mcp_server.py CHANGED Viewed

@@ -1,433 +1,296 @@
-"""
-MCP Server for Agricultural Data Analysis
-Provides tools and resources for analyzing agricultural intervention data.
-"""
-import json
-import logging
-from typing import Any, Dict, List, Optional
-from mcp.server import Server
-from mcp.server.models import InitializationOptions
-from mcp.server.stdio import stdio_server
-from mcp.types import Resource, Tool, TextContent
-import asyncio
 import pandas as pd
 from data_loader import AgriculturalDataLoader
-from analysis_tools import AgriculturalAnalyzer
-import plotly.io as pio
-# Set up logging
-logging.basicConfig(level=logging.INFO)
-logger = logging.getLogger("agricultural-mcp-server")
-# Initialize data components
-data_loader = AgriculturalDataLoader()
-analyzer = AgriculturalAnalyzer(data_loader)
-# Create MCP server
-server = Server("agricultural-analysis")
-@server.list_resources()
-async def list_resources() -> List[Resource]:
-    """List available resources."""
-    return [
-        Resource(
-            uri="agricultural://data/summary",
-            name="Data Summary",
-            mimeType="application/json",
-            description="Summary of available agricultural intervention data"
-        ),
-        Resource(
-            uri="agricultural://data/years",
-            name="Available Years",
-            mimeType="application/json",
-            description="List of years with available data"
-        ),
-        Resource(
-            uri="agricultural://data/plots",
-            name="Available Plots",
-            mimeType="application/json",
-            description="List of available plots/parcels"
-        ),
-        Resource(
-            uri="agricultural://data/crops",
-            name="Available Crops",
-            mimeType="application/json",
-            description="List of available crop types"
-        ),
-        Resource(
-            uri="agricultural://analysis/weed-pressure",
-            name="Weed Pressure Analysis",
-            mimeType="application/json",
-            description="Current weed pressure trends analysis"
-        ),
-        Resource(
-            uri="agricultural://analysis/rotation-impact",
-            name="Crop Rotation Impact",
-            mimeType="application/json",
-            description="Analysis of crop rotation impact on weed pressure"
-        )
-    ]
-@server.read_resource()
-async def read_resource(uri: str) -> str:
-    """Read a specific resource."""
     try:
-        if uri == "agricultural://data/summary":
-            df = data_loader.load_all_files()
-            summary = {
-                "total_records": len(df),
-                "date_range": {
-                    "start": df['datedebut'].min().strftime('%Y-%m-%d') if df['datedebut'].min() else None,
-                    "end": df['datedebut'].max().strftime('%Y-%m-%d') if df['datedebut'].max() else None
-                },
-                "unique_plots": df['plot_name'].nunique(),
-                "unique_crops": df['crop_type'].nunique(),
-                "herbicide_applications": len(df[df['is_herbicide'] == True]),
-                "years_covered": sorted(df['year'].unique().tolist())
-            }
-            return json.dumps(summary, indent=2)
-        elif uri == "agricultural://data/years":
-            years = data_loader.get_years_available()
-            return json.dumps({"available_years": years})
-        elif uri == "agricultural://data/plots":
-            plots = data_loader.get_plots_available()
-            return json.dumps({"available_plots": plots})
-        elif uri == "agricultural://data/crops":
-            crops = data_loader.get_crops_available()
-            return json.dumps({"available_crops": crops})
-        elif uri == "agricultural://analysis/weed-pressure":
-            trends = analyzer.analyze_weed_pressure_trends()
-            # Convert DataFrames to dict for JSON serialization
-            serializable_trends = {}
-            for key, value in trends.items():
-                if isinstance(value, pd.DataFrame):
-                    serializable_trends[key] = value.to_dict('records')
-                else:
-                    serializable_trends[key] = value
-            return json.dumps(serializable_trends, indent=2)
-        elif uri == "agricultural://analysis/rotation-impact":
-            rotation_impact = analyzer.analyze_crop_rotation_impact()
-            return json.dumps(rotation_impact.to_dict('records'), indent=2)
         else:
-            raise ValueError(f"Unknown resource: {uri}")
     except Exception as e:
-        logger.error(f"Error reading resource {uri}: {e}")
-        return json.dumps({"error": str(e)})
-@server.list_tools()
-async def list_tools() -> List[Tool]:
-    """List available tools."""
-    return [
-        Tool(
-            name="filter_data",
-            description="Filter agricultural data by years, plots, crops, or intervention types",
-            inputSchema={
-                "type": "object",
-                "properties": {
-                    "years": {
-                        "type": "array",
-                        "items": {"type": "integer"},
-                        "description": "List of years to filter (e.g., [2022, 2023, 2024])"
-                    },
-                    "plots": {
-                        "type": "array",
-                        "items": {"type": "string"},
-                        "description": "List of plot names to filter"
-                    },
-                    "crops": {
-                        "type": "array",
-                        "items": {"type": "string"},
-                        "description": "List of crop types to filter"
-                    },
-                    "intervention_types": {
-                        "type": "array",
-                        "items": {"type": "string"},
-                        "description": "List of intervention types to filter"
-                    }
-                }
-            }
-        ),
-        Tool(
-            name="analyze_weed_pressure",
-            description="Analyze weed pressure trends based on herbicide usage (IFT)",
-            inputSchema={
-                "type": "object",
-                "properties": {
-                    "years": {
-                        "type": "array",
-                        "items": {"type": "integer"},
-                        "description": "Years to analyze"
-                    },
-                    "plots": {
-                        "type": "array",
-                        "items": {"type": "string"},
-                        "description": "Plots to analyze"
-                    },
-                    "include_visualization": {
-                        "type": "boolean",
-                        "description": "Whether to include visualization data",
-                        "default": True
-                    }
-                }
-            }
-        ),
-        Tool(
-            name="predict_weed_pressure",
-            description="Predict weed pressure for the next 3 years using machine learning",
-            inputSchema={
-                "type": "object",
-                "properties": {
-                    "target_years": {
-                        "type": "array",
-                        "items": {"type": "integer"},
-                        "description": "Years to predict (default: [2025, 2026, 2027])",
-                        "default": [2025, 2026, 2027]
-                    },
-                    "plots": {
-                        "type": "array",
-                        "items": {"type": "string"},
-                        "description": "Specific plots to predict for (optional)"
-                    }
-                }
-            }
-        ),
-        Tool(
-            name="identify_suitable_plots",
-            description="Identify plots suitable for sensitive crops (peas, beans) based on low weed pressure",
-            inputSchema={
-                "type": "object",
-                "properties": {
-                    "target_years": {
-                        "type": "array",
-                        "items": {"type": "integer"},
-                        "description": "Years to evaluate (default: [2025, 2026, 2027])",
-                        "default": [2025, 2026, 2027]
-                    },
-                    "max_ift_threshold": {
-                        "type": "number",
-                        "description": "Maximum IFT threshold for suitable plots (default: 1.0)",
-                        "default": 1.0
-                    }
-                }
-            }
-        ),
-        Tool(
-            name="analyze_crop_rotation",
-            description="Analyze the impact of crop rotation patterns on weed pressure",
-            inputSchema={
-                "type": "object",
-                "properties": {}
-            }
-        ),
-        Tool(
-            name="analyze_herbicide_alternatives",
-            description="Analyze herbicide usage patterns and identify most used products",
-            inputSchema={
-                "type": "object",
-                "properties": {}
-            }
-        ),
-        Tool(
-            name="get_data_statistics",
-            description="Get comprehensive statistics about the agricultural data",
-            inputSchema={
-                "type": "object",
-                "properties": {
-                    "years": {
-                        "type": "array",
-                        "items": {"type": "integer"},
-                        "description": "Years to analyze (optional)"
-                    },
-                    "plots": {
-                        "type": "array",
-                        "items": {"type": "string"},
-                        "description": "Plots to analyze (optional)"
-                    }
-                }
-            }
-        )
-    ]
-@server.call_tool()
-async def call_tool(name: str, arguments: Dict[str, Any]) -> List[TextContent]:
-    """Execute a tool call."""
     try:
-        if name == "filter_data":
-            df = data_loader.filter_data(
-                years=arguments.get("years"),
-                plots=arguments.get("plots"),
-                crops=arguments.get("crops"),
-                intervention_types=arguments.get("intervention_types")
-            )
-            result = {
-                "filtered_records": len(df),
-                "summary": {
-                    "unique_plots": df['plot_name'].nunique(),
-                    "unique_crops": df['crop_type'].nunique(),
-                    "year_range": [int(df['year'].min()), int(df['year'].max())] if len(df) > 0 else [],
-                    "herbicide_applications": len(df[df['is_herbicide'] == True])
-                },
-                "sample_data": df.head(10).to_dict('records') if len(df) > 0 else []
-            }
-            return [TextContent(
-                type="text",
-                text=json.dumps(result, indent=2, default=str)
-            )]
-        elif name == "analyze_weed_pressure":
-            trends = analyzer.analyze_weed_pressure_trends(
-                years=arguments.get("years"),
-                plots=arguments.get("plots")
-            )
-            # Convert DataFrames to dict for JSON serialization
-            serializable_trends = {}
-            for key, value in trends.items():
-                if isinstance(value, pd.DataFrame):
-                    serializable_trends[key] = value.to_dict('records')
-                else:
-                    serializable_trends[key] = value
-            # Include visualization if requested
-            if arguments.get("include_visualization", True):
-                try:
-                    fig = analyzer.create_weed_pressure_visualization(
-                        years=arguments.get("years"),
-                        plots=arguments.get("plots")
-                    )
-                    # Convert plot to HTML
-                    serializable_trends["visualization_html"] = pio.to_html(fig, include_plotlyjs=True)
-                except Exception as e:
-                    serializable_trends["visualization_error"] = str(e)
-            return [TextContent(
-                type="text",
-                text=json.dumps(serializable_trends, indent=2, default=str)
-            )]
-        elif name == "predict_weed_pressure":
-            predictions = analyzer.predict_weed_pressure(
-                target_years=arguments.get("target_years", [2025, 2026, 2027]),
-                plots=arguments.get("plots")
-            )
-            # Convert DataFrames to dict for JSON serialization
-            serializable_predictions = {}
-            for key, value in predictions.items():
-                if key == "predictions":
-                    serializable_predictions[key] = {}
-                    for year, df in value.items():
-                        serializable_predictions[key][year] = df.to_dict('records')
-                elif isinstance(value, pd.DataFrame):
-                    serializable_predictions[key] = value.to_dict('records')
-                else:
-                    serializable_predictions[key] = value
-            return [TextContent(
-                type="text",
-                text=json.dumps(serializable_predictions, indent=2, default=str)
-            )]
-        elif name == "identify_suitable_plots":
-            suitable_plots = analyzer.identify_suitable_plots_for_sensitive_crops(
-                target_years=arguments.get("target_years", [2025, 2026, 2027]),
-                max_ift_threshold=arguments.get("max_ift_threshold", 1.0)
-            )
-            return [TextContent(
-                type="text",
-                text=json.dumps(suitable_plots, indent=2)
-            )]
-        elif name == "analyze_crop_rotation":
-            rotation_impact = analyzer.analyze_crop_rotation_impact()
-            return [TextContent(
-                type="text",
-                text=json.dumps(rotation_impact.to_dict('records'), indent=2, default=str)
-            )]
-        elif name == "analyze_herbicide_alternatives":
-            herbicide_analysis = analyzer.analyze_herbicide_alternatives()
-            return [TextContent(
-                type="text",
-                text=json.dumps(herbicide_analysis.to_dict('records'), indent=2, default=str)
-            )]
-        elif name == "get_data_statistics":
-            df = data_loader.filter_data(
-                years=arguments.get("years"),
-                plots=arguments.get("plots")
-            )
-            stats = {
-                "general": {
-                    "total_records": len(df),
-                    "unique_plots": df['plot_name'].nunique(),
-                    "unique_crops": df['crop_type'].nunique(),
-                    "date_range": {
-                        "start": df['datedebut'].min().strftime('%Y-%m-%d') if not df['datedebut'].isna().all() else None,
-                        "end": df['datedebut'].max().strftime('%Y-%m-%d') if not df['datedebut'].isna().all() else None
-                    }
-                },
-                "interventions": {
-                    "total_herbicide": len(df[df['is_herbicide'] == True]),
-                    "total_fungicide": len(df[df['is_fungicide'] == True]),
-                    "total_insecticide": len(df[df['is_insecticide'] == True])
-                },
-                "top_crops": df['crop_type'].value_counts().head(10).to_dict(),
-                "top_plots": df['plot_name'].value_counts().head(10).to_dict(),
-                "yearly_distribution": df['year'].value_counts().sort_index().to_dict()
-            }
-            return [TextContent(
-                type="text",
-                text=json.dumps(stats, indent=2, default=str)
-            )]
-        else:
-            raise ValueError(f"Unknown tool: {name}")
-    except Exception as e:
-        logger.error(f"Error executing tool {name}: {e}")
-        return [TextContent(
-            type="text",
-            text=json.dumps({"error": str(e)}, indent=2)
-        )]
-async def main():
-    """Main function to run the MCP server."""
-    logger.info("Starting Agricultural MCP Server...")
-    # Initialize the server
-    async with stdio_server() as (read_stream, write_stream):
-        await server.run(
-            read_stream,
-            write_stream,
-            InitializationOptions(
-                server_name="agricultural-analysis",
-                server_version="1.0.0",
-                capabilities=server.get_capabilities()
-            )
-        )
 if __name__ == "__main__":
-    asyncio.run(main())

+"""MCP Server for Agricultural Weed Pressure Analysis"""
+import gradio as gr
 import pandas as pd
+import numpy as np
+import plotly.express as px
 from data_loader import AgriculturalDataLoader
+import warnings
+warnings.filterwarnings('ignore')
+class WeedPressureAnalyzer:
+    """Analyze weed pressure and recommend plots for sensitive crops."""
+    def __init__(self):
+        self.data_loader = AgriculturalDataLoader()
+        self.data_cache = None
+    def load_data(self):
+        if self.data_cache is None:
+            self.data_cache = self.data_loader.load_all_files()
+        return self.data_cache
+    def calculate_herbicide_ift(self, years=None):
+        """Calculate IFT for herbicides by plot and year."""
+        df = self.load_data()
+        if years:
+            df = df[df['year'].isin(years)]
+        herbicide_df = df[df['is_herbicide'] == True].copy()
+        if len(herbicide_df) == 0:
+            return pd.DataFrame()
+        ift_summary = herbicide_df.groupby(['plot_name', 'year', 'crop_type']).agg({
+            'produit': 'count',
+            'plot_surface': 'first',
+            'quantitetot': 'sum'
+        }).reset_index()
+        ift_summary['ift_herbicide'] = ift_summary['produit'] / ift_summary['plot_surface']
+        return ift_summary
+    def predict_weed_pressure(self, target_years=[2025, 2026, 2027]):
+        """Predict weed pressure for future years."""
+        ift_data = self.calculate_herbicide_ift()
+        if len(ift_data) == 0:
+            return pd.DataFrame()
+        predictions = []
+        for plot in ift_data['plot_name'].unique():
+            plot_data = ift_data[ift_data['plot_name'] == plot].sort_values('year')
+            if len(plot_data) < 2:
+                continue
+            years = plot_data['year'].values
+            ift_values = plot_data['ift_herbicide'].values
+            if len(years) > 1:
+                slope = np.polyfit(years, ift_values, 1)[0]
+                intercept = np.polyfit(years, ift_values, 1)[1]
+                for target_year in target_years:
+                    predicted_ift = slope * target_year + intercept
+                    predicted_ift = max(0, predicted_ift)
+                    if predicted_ift < 1.0:
+                        risk_level = "Faible"
+                    elif predicted_ift < 2.0:
+                        risk_level = "Modéré"
+                    else:
+                        risk_level = "Élevé"
+                    predictions.append({
+                        'plot_name': plot,
+                        'year': target_year,
+                        'predicted_ift': predicted_ift,
+                        'risk_level': risk_level,
+                        'recent_crops': ', '.join(plot_data['crop_type'].tail(3).unique()),
+                        'historical_avg_ift': plot_data['ift_herbicide'].mean()
+                    })
+        return pd.DataFrame(predictions)
+# Initialize analyzer
+analyzer = WeedPressureAnalyzer()
+def analyze_herbicide_trends(years_range, plot_filter):
+    """Analyze herbicide usage trends over time."""
+    try:
+        if len(years_range) == 2:
+            years = list(range(int(years_range[0]), int(years_range[1]) + 1))
+        else:
+            years = [int(y) for y in years_range]
+        ift_data = analyzer.calculate_herbicide_ift(years=years)
+        if len(ift_data) == 0:
+            return None, "Aucune donnée d'herbicides trouvée."
+        if plot_filter != "Toutes":
+            ift_data = ift_data[ift_data['plot_name'] == plot_filter]
+        fig = px.line(ift_data,
+                     x='year',
+                     y='ift_herbicide',
+                     color='plot_name',
+                     title=f'Évolution de l\'IFT Herbicides',
+                     labels={'ift_herbicide': 'IFT Herbicides', 'year': 'Année'})
+        summary = f"""
+📊 **Analyse de l'IFT Herbicides**
+**Statistiques:**
+- IFT moyen: {ift_data['ift_herbicide'].mean():.2f}
+- IFT maximum: {ift_data['ift_herbicide'].max():.2f}
+- Nombre de parcelles: {ift_data['plot_name'].nunique()}
+**Interprétation:**
+- IFT < 1.0: Pression faible ✅
+- IFT 1.0-2.0: Pression modérée ⚠️
+- IFT > 2.0: Pression élevée ❌
+        """
+        return fig, summary
+    except Exception as e:
+        return None, f"Erreur: {str(e)}"
+def predict_future_weed_pressure():
+    """Predict weed pressure for the next 3 years."""
+    try:
+        predictions = analyzer.predict_weed_pressure()
+        if len(predictions) == 0:
+            return None, "Impossible de générer des prédictions."
+        fig = px.bar(predictions,
+                    x='plot_name',
+                    y='predicted_ift',
+                    color='risk_level',
+                    facet_col='year',
+                    title='Prédiction Pression Adventices (2025-2027)',
+                    color_discrete_map={'Faible': 'green', 'Modéré': 'orange', 'Élevé': 'red'})
+        low_risk = len(predictions[predictions['risk_level'] == 'Faible'])
+        moderate_risk = len(predictions[predictions['risk_level'] == 'Modéré'])
+        high_risk = len(predictions[predictions['risk_level'] == 'Élevé'])
+        summary = f"""
+🔮 **Prédictions 2025-2027**
+**Répartition des risques:**
+- ✅ Risque faible: {low_risk} prédictions
+- ⚠️ Risque modéré: {moderate_risk} prédictions
+- ❌ Risque élevé: {high_risk} prédictions
+        """
+        return fig, summary
+    except Exception as e:
+        return None, f"Erreur: {str(e)}"
+def recommend_sensitive_crop_plots():
+    """Recommend plots for sensitive crops."""
     try:
+        predictions = analyzer.predict_weed_pressure()
+        if len(predictions) == 0:
+            return None, "Aucune recommandation disponible."
+        suitable_plots = predictions[predictions['risk_level'] == "Faible"].copy()
+        if len(suitable_plots) > 0:
+            suitable_plots['recommendation_score'] = 100 - (suitable_plots['predicted_ift'] * 30)
+            suitable_plots = suitable_plots.sort_values('recommendation_score', ascending=False)
+            top_recommendations = suitable_plots.head(10)[['plot_name', 'year', 'predicted_ift', 'recommendation_score']]
+            summary = f"""
+🌱 **Recommandations Cultures Sensibles**
+**Top parcelles recommandées:**
+{top_recommendations.to_string(index=False)}
+**Critères:** IFT prédit < 1.0 (faible pression adventices)
+            """
+            fig = px.scatter(suitable_plots,
+                           x='predicted_ift',
+                           y='recommendation_score',
+                           color='year',
+                           hover_data=['plot_name'],
+                           title='Parcelles Recommandées pour Cultures Sensibles')
+            return fig, summary
         else:
+            return None, "Aucune parcelle à faible risque identifiée."
     except Exception as e:
+        return None, f"Erreur: {str(e)}"
+def generate_technical_alternatives(herbicide_family):
+    """Generate technical alternatives."""
+    summary = f"""
+🔄 **Alternatives aux {herbicide_family}**
+**🚜 Alternatives Mécaniques:**
+• Faux-semis répétés avant implantation
+• Binage mécanique en inter-rang
+• Herse étrille en post-levée précoce
+**🌾 Alternatives Culturales:**
+• Rotation longue avec prairie temporaire
+• Cultures intermédiaires piège à nitrates
+• Densité de semis optimisée
+**🧪 Alternatives Biologiques:**
+• Stimulateurs de défenses naturelles
+• Extraits végétaux (huiles essentielles)
+• Bioherbicides à base de champignons
+**📋 Plan d'Action:**
+1. Tester sur petites surfaces
+2. Former les équipes
+3. Suivre l'efficacité
+4. Documenter les résultats
+    """
+    return summary
+def get_available_plots():
+    """Get available plots."""
     try:
+        plots = analyzer.data_loader.get_plots_available()
+        return ["Toutes"] + plots
+    except:
+        return ["Toutes"]
+# Create Gradio Interface
+def create_mcp_interface():
+    with gr.Blocks(title="🚜 Analyse Pression Adventices", theme=gr.themes.Soft()) as demo:
+        gr.Markdown("""
+        # 🚜 Analyse Pression Adventices - CRA Bretagne
+        Anticiper et réduire la pression des adventices pour optimiser les cultures sensibles (pois, haricot).
+        """)
+        with gr.Tabs():
+            with gr.Tab("📈 Analyse Tendances"):
+                with gr.Row():
+                    years_slider = gr.Slider(2014, 2024, value=[2020, 2024], step=1, label="Période")
+                    plot_dropdown = gr.Dropdown(choices=get_available_plots(), value="Toutes", label="Parcelle")
+                analyze_btn = gr.Button("🔍 Analyser", variant="primary")
+                with gr.Row():
+                    trends_plot = gr.Plot()
+                    trends_summary = gr.Markdown()
+                analyze_btn.click(analyze_herbicide_trends, [years_slider, plot_dropdown], [trends_plot, trends_summary])
+            with gr.Tab("🔮 Prédictions"):
+                predict_btn = gr.Button("🎯 Prédire 2025-2027", variant="primary")
+                with gr.Row():
+                    predictions_plot = gr.Plot()
+                    predictions_summary = gr.Markdown()
+                predict_btn.click(predict_future_weed_pressure, outputs=[predictions_plot, predictions_summary])
+            with gr.Tab("🌱 Recommandations"):
+                recommend_btn = gr.Button("🎯 Recommander Parcelles", variant="primary")
+                with gr.Row():
+                    recommendations_plot = gr.Plot()
+                    recommendations_summary = gr.Markdown()
+                recommend_btn.click(recommend_sensitive_crop_plots, outputs=[recommendations_plot, recommendations_summary])
+            with gr.Tab("🔄 Alternatives"):
+                herbicide_type = gr.Dropdown(["Herbicides", "Fongicides"], value="Herbicides", label="Type")
+                alternatives_btn = gr.Button("💡 Générer Alternatives", variant="primary")
+                alternatives_output = gr.Markdown()
+                alternatives_btn.click(generate_technical_alternatives, [herbicide_type], [alternatives_output])
+    return demo
 if __name__ == "__main__":
+    demo = create_mcp_interface()
+    demo.launch(mcp_server=True, server_name="0.0.0.0", server_port=7860, share=True)

requirements.txt CHANGED Viewed

@@ -1,10 +1,8 @@
-gradio[mcp]>=4.43
 pandas>=2.0.0
 numpy>=1.24.0
-matplotlib>=3.6.0
-seaborn>=0.12.0
-scikit-learn>=1.3.0
 datasets>=2.14.0
-huggingface_hub>=0.17.0
-openpyxl>=3.1.0
-plotly>=5.15.0

+gradio>=4.0.0
 pandas>=2.0.0
 numpy>=1.24.0
+plotly>=5.0.0
 datasets>=2.14.0
+huggingface_hub>=0.16.0
+matplotlib>=3.7.0
+seaborn>=0.12.0

test_data_sources.py DELETED Viewed

@@ -1,190 +0,0 @@
-#!/usr/bin/env python3
-"""
-Test script to demonstrate loading data from both local files and Hugging Face.
-"""
-import warnings
-warnings.filterwarnings('ignore')
-from data_loader import AgriculturalDataLoader
-import os
-def test_local_loading():
-    """Test loading from local files."""
-    print("🔍 TESTING LOCAL FILE LOADING")
-    print("=" * 50)
-    try:
-        # Create loader for local files
-        loader = AgriculturalDataLoader.create_local_loader(
-            data_path="/Users/tracyandre/Downloads/OneDrive_1_9-17-2025"
-        )
-        # Load data
-        df = loader.load_all_files()
-        print(f"✅ Local loading successful!")
-        print(f"📊 Records: {len(df):,}")
-        print(f"📅 Years: {sorted(df['year'].unique())}")
-        print(f"🌱 Crops: {df['crop_type'].nunique()}")
-        print(f"📍 Plots: {df['plot_name'].nunique()}")
-        return True
-    except Exception as e:
-        print(f"❌ Local loading failed: {e}")
-        return False
-def test_hf_loading():
-    """Test loading from Hugging Face."""
-    print("\n🤗 TESTING HUGGING FACE LOADING")
-    print("=" * 50)
-    # Check if HF token is available
-    hf_token = os.environ.get("HF_TOKEN")
-    if not hf_token:
-        print("⚠️  No HF_TOKEN found in environment variables")
-        print("💡 Set HF_TOKEN to test Hugging Face loading")
-        return False
-    try:
-        # Create loader for Hugging Face
-        loader = AgriculturalDataLoader.create_hf_loader(
-            dataset_id="HackathonCRA/2024",
-            hf_token=hf_token
-        )
-        # Load data
-        df = loader.load_from_huggingface()
-        print(f"✅ Hugging Face loading successful!")
-        print(f"📊 Records: {len(df):,}")
-        print(f"📅 Years: {sorted(df['year'].unique())}")
-        print(f"🌱 Crops: {df['crop_type'].nunique()}")
-        print(f"📍 Plots: {df['plot_name'].nunique()}")
-        return True
-    except Exception as e:
-        print(f"❌ Hugging Face loading failed: {e}")
-        print("💡 Make sure the dataset exists and you have access")
-        return False
-def test_auto_fallback():
-    """Test automatic fallback from HF to local files."""
-    print("\n🔄 TESTING AUTO FALLBACK (HF -> LOCAL)")
-    print("=" * 50)
-    try:
-        # Create loader with HF enabled but potentially failing
-        loader = AgriculturalDataLoader(
-            data_path="/Users/tracyandre/Downloads/OneDrive_1_9-17-2025",
-            dataset_id="nonexistent-dataset",  # This should fail
-            use_hf=True
-        )
-        # This should try HF first, then fallback to local
-        df = loader.load_all_files()
-        print(f"✅ Auto fallback successful!")
-        print(f"📊 Records: {len(df):,}")
-        print("🔄 Successfully fell back to local files after HF failure")
-        return True
-    except Exception as e:
-        print(f"❌ Auto fallback failed: {e}")
-        return False
-def test_data_source_switching():
-    """Test switching between data sources."""
-    print("\n🔀 TESTING DATA SOURCE SWITCHING")
-    print("=" * 50)
-    try:
-        # Create loader
-        loader = AgriculturalDataLoader(
-            data_path="/Users/tracyandre/Downloads/OneDrive_1_9-17-2025",
-            dataset_id="HackathonCRA/2024"
-        )
-        # Test local loading
-        loader.set_data_source(use_hf=False)
-        df_local = loader.load_all_files()
-        print(f"📁 Local: {len(df_local):,} records")
-        # Test switching to HF (if token available)
-        if os.environ.get("HF_TOKEN"):
-            loader.set_data_source(use_hf=True)
-            try:
-                df_hf = loader.load_all_files()
-                print(f"🤗 HF: {len(df_hf):,} records")
-                # Compare data
-                if len(df_local) == len(df_hf):
-                    print("✅ Data consistency: Same number of records")
-                else:
-                    print(f"⚠️  Data difference: Local={len(df_local)}, HF={len(df_hf)}")
-            except Exception as e:
-                print(f"🤗 HF loading failed (expected): {e}")
-        else:
-            print("⚠️  No HF_TOKEN - skipping HF test")
-        return True
-    except Exception as e:
-        print(f"❌ Data source switching failed: {e}")
-        return False
-def main():
-    """Run all tests."""
-    print("🚜 AGRICULTURAL DATA LOADER TESTING")
-    print("=" * 60)
-    print()
-    results = []
-    # Test 1: Local loading
-    results.append(("Local Loading", test_local_loading()))
-    # Test 2: Hugging Face loading
-    results.append(("HF Loading", test_hf_loading()))
-    # Test 3: Auto fallback
-    results.append(("Auto Fallback", test_auto_fallback()))
-    # Test 4: Data source switching
-    results.append(("Source Switching", test_data_source_switching()))
-    # Summary
-    print("\n📋 TEST SUMMARY")
-    print("=" * 30)
-    passed = 0
-    for test_name, result in results:
-        status = "✅ PASS" if result else "❌ FAIL"
-        print(f"{test_name:<20} {status}")
-        if result:
-            passed += 1
-    print(f"\n🎯 Results: {passed}/{len(results)} tests passed")
-    if passed == len(results):
-        print("🎉 All tests passed! Data loader is working correctly.")
-    else:
-        print("⚠️  Some tests failed. Check the output above for details.")
-    print("\n💡 Usage Examples:")
-    print("# Load from local files:")
-    print("loader = AgriculturalDataLoader.create_local_loader('/path/to/data')")
-    print()
-    print("# Load from Hugging Face:")
-    print("loader = AgriculturalDataLoader.create_hf_loader('HackathonCRA/2024')")
-    print()
-    print("# Auto-detect with fallback:")
-    print("loader = AgriculturalDataLoader(use_hf=True)")
-    print("df = loader.load_all_files()  # Tries HF first, falls back to local")
-if __name__ == "__main__":
-    main()

test_hf_only.py DELETED Viewed

@@ -1,155 +0,0 @@
-#!/usr/bin/env python3
-"""
-Test script to validate Hugging Face only loading.
-"""
-import os
-import warnings
-warnings.filterwarnings('ignore')
-def test_hf_only_loading():
-    """Test that the loader only works with Hugging Face."""
-    print("🤗 TESTING HUGGING FACE ONLY LOADING")
-    print("=" * 50)
-    from data_loader import AgriculturalDataLoader
-    # Check if HF token is available
-    hf_token = os.environ.get("HF_TOKEN")
-    if not hf_token:
-        print("⚠️  No HF_TOKEN found in environment variables")
-        print("💡 Set HF_TOKEN to test Hugging Face loading")
-        print("🔧 For this test, we'll try without token (may fail)")
-    try:
-        # Create loader (HF only)
-        loader = AgriculturalDataLoader(
-            dataset_id="HackathonCRA/2024",
-            hf_token=hf_token
-        )
-        print(f"🤗 Attempting to load from dataset: {loader.dataset_id}")
-        # Load data
-        df = loader.load_all_files()
-        print(f"✅ Success! Loaded {len(df):,} records from Hugging Face")
-        print(f"📊 Years: {sorted(df['year'].unique())}")
-        print(f"🌱 Crops: {df['crop_type'].nunique()}")
-        print(f"📍 Plots: {df['plot_name'].nunique()}")
-        print(f"💊 Herbicide applications: {df['is_herbicide'].sum()}")
-        return True
-    except Exception as e:
-        print(f"❌ Failed to load from Hugging Face: {e}")
-        print("💡 This is expected if the dataset doesn't exist yet")
-        print("🔧 Make sure to upload your dataset to HF Hub first")
-        return False
-def test_no_local_fallback():
-    """Test that there's no local fallback."""
-    print("\n🚫 TESTING NO LOCAL FALLBACK")
-    print("=" * 50)
-    from data_loader import AgriculturalDataLoader
-    try:
-        # Create loader with non-existent dataset
-        loader = AgriculturalDataLoader(
-            dataset_id="nonexistent/dataset"
-        )
-        # This should fail without falling back to local
-        df = loader.load_all_files()
-        print(f"❌ Unexpected success - loaded {len(df)} records")
-        print("⚠️  This suggests local fallback is still active")
-        return False
-    except Exception as e:
-        print(f"✅ Expected failure: {e}")
-        print("✅ Confirmed: No local fallback, HF only")
-        return True
-def test_simple_usage():
-    """Test simple usage pattern."""
-    print("\n📝 SIMPLE USAGE EXAMPLE")
-    print("=" * 50)
-    print("💡 Recommended usage pattern:")
-    print()
-    usage_code = '''
-from data_loader import AgriculturalDataLoader
-# Simple HF-only loader
-loader = AgriculturalDataLoader(dataset_id="HackathonCRA/2024")
-# Load data (will use HF_TOKEN from environment)
-df = loader.load_all_files()
-# Analyze data
-print(f"Loaded {len(df)} records from Hugging Face")
-'''
-    print(usage_code)
-    try:
-        from data_loader import AgriculturalDataLoader
-        loader = AgriculturalDataLoader(dataset_id="HackathonCRA/2024")
-        print("✅ Loader created successfully")
-        print(f"🎯 Target dataset: {loader.dataset_id}")
-        print(f"🔑 Using token: {'Yes' if loader.hf_token else 'No (from env)'}")
-        return True
-    except Exception as e:
-        print(f"❌ Failed to create loader: {e}")
-        return False
-def main():
-    """Run all tests."""
-    print("🚜 HUGGING FACE ONLY - VALIDATION TESTS")
-    print("=" * 60)
-    print()
-    results = []
-    # Test 1: HF loading
-    results.append(("HF Only Loading", test_hf_only_loading()))
-    # Test 2: No local fallback
-    results.append(("No Local Fallback", test_no_local_fallback()))
-    # Test 3: Simple usage
-    results.append(("Simple Usage", test_simple_usage()))
-    # Summary
-    print("\n📋 TEST SUMMARY")
-    print("=" * 30)
-    passed = 0
-    for test_name, result in results:
-        status = "✅ PASS" if result else "❌ FAIL"
-        print(f"{test_name:<20} {status}")
-        if result:
-            passed += 1
-    print(f"\n🎯 Results: {passed}/{len(results)} tests passed")
-    if passed >= 2:  # Allow HF loading to fail if dataset doesn't exist
-        print("🎉 Validation successful! Loader is HF-only.")
-    else:
-        print("⚠️  Validation issues detected.")
-    print("\n🚀 DEPLOYMENT CHECKLIST:")
-    print("✅ Remove local file dependencies")
-    print("✅ HF-only data loading")
-    print("✅ No fallback mechanisms")
-    print("🔲 Upload dataset to HF Hub")
-    print("🔲 Set HF_TOKEN in production")
-    print("🔲 Test with real HF dataset")
-if __name__ == "__main__":
-    main()