Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
CoreyMorris
/
MMLU-by-task-Leaderboard
like
16
Sleeping
App
Files
Files
Community
4
Fetching metadata from the HF Docker repository...
a0c39f5
MMLU-by-task-Leaderboard
1.78 MB
4 contributors
History:
168 commits
Corey Morris
Loading new csv with updated data
a0c39f5
about 2 years ago
.github
added a test and removed the code to only test a specific file because that code did not work
over 2 years ago
.gitattributes
1.52 kB
initial commit
over 2 years ago
.gitignore
68 Bytes
updated gitignore
over 2 years ago
.gitmodules
106 Bytes
added hugging face evaluation harness results submodule
over 2 years ago
README.md
248 Bytes
initial commit
over 2 years ago
app.py
15.9 kB
Loading new csv with updated data
about 2 years ago
contaminated_models.csv
117 Bytes
Updated contaminated models
over 2 years ago
contaminated_models.txt
65 Bytes
Updated contaminated models
over 2 years ago
details_data_processor.py
4.04 kB
updated pipeline and init
over 2 years ago
dev_requirements.txt
252 Bytes
updated dev requirements
over 2 years ago
moral_app.py
11.1 kB
Extracted plotting functions from moral_app to plotting_utils to improve organization and testability
over 2 years ago
moral_scenarios_questions.csv
370 kB
Show a random question from the moral scenarios evaluation
over 2 years ago
plotting_utils.py
4.42 kB
Extracted plotting functions from moral_app to plotting_utils to improve organization and testability
over 2 years ago
requirements.txt
156 Bytes
Updated dependencies
over 2 years ago
result_data.csv
1.35 MB
updated
about 2 years ago
result_data_processor.py
6.97 kB
moved organization column to the front
about 2 years ago
save_for_regression.py
1.86 kB
changed to save and load in a directory
over 2 years ago
split_question.py
964 Bytes
added code to split moral scenario question from one question to two
over 2 years ago
test_details_data_processing.py
4.33 kB
added a test
over 2 years ago
test_integration.py
1.96 kB
fixed test_streamlit_app_runs
over 2 years ago
test_paths.py
780 Bytes
added a test and removed the code to only test a specific file because that code did not work
over 2 years ago
test_regression.py
1.26 kB
added todo for test
over 2 years ago
test_result_data_processing.py
1.66 kB
Added organization to dataframe
over 2 years ago