Spaces:
Runtime error
Runtime error
add resources
Browse files- resources/embedding_model_scores.csv +11 -0
- resources/flowchart.png +0 -0
- resources/langsmith_walkthrough.mp4 +3 -0
- resources/link-to-flowchart.txt +1 -0
- resources/llm_scores.csv +94 -0
- resources/search-limit-chinese.png +0 -0
- resources/test_llm_standalone_chat.txt +21 -0
- resources/vectordatabase_evaluation.csv +6 -0
resources/embedding_model_scores.csv
ADDED
|
@@ -0,0 +1,11 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
Model name,Provider,Model size (#pamras),Model Size (disk),Download past month,Highlights,Time Load/Inference (online compute),Mean difference paired & unpaired Q & Docs
|
| 2 |
+
intfloat/multilingual-e5-large,Microsoft,560M,2.2G,93K,24 layers and the embedding size is 1024,5.0s/1920s,0.062
|
| 3 |
+
intfloat/multilingual-e5-base,Microsoft,278M,1.1G,42K,12 layers and the embedding size is 768,3.4s/531s,0.063
|
| 4 |
+
sentence-transformers/LaBSE,Google,,1.9G,88K,the embedding size is 768,5.7s/620s,0.19
|
| 5 |
+
maidalun1020/bce-embedding-base_v1,NetEase-Youdao,279M,1.1G,111K,optimized for RAG,3.0s/495s,0.23
|
| 6 |
+
BAAI/bge-large-zh-v1.5,Beijing Academy of Artificial Intelligence,326M,1.3G,22K,,1.6s/1730s,0.26
|
| 7 |
+
uer/sbert-base-chinese-nli,Tencent,,409M,8K,12 layers and the embedding size is 768,0.6s/1350s,0.22
|
| 8 |
+
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2,Sentence Transformer,,449M,38K,384 embedding size,1.4s/392s,0.25
|
| 9 |
+
sentence-transformers/distiluse-base-multilingual-cased-v1,Sentence Transformer,,539M,31K,768 embedding size,1.3s/163s,0.28
|
| 10 |
+
sentence-transformers/distiluse-base-multilingual-cased-v2,Sentence Transformer,,539M,43K,768 enbedding size,1.2s/164s,0.25
|
| 11 |
+
sentence-transformers/paraphrase-multilingual-mpnet-base-v2,Sentence Transformer,,1.1G,24K,768 embedding size,2.7s/463s,0.21
|
resources/flowchart.png
ADDED
|
resources/langsmith_walkthrough.mp4
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:463ca06f3e980b75c325e5cc93656a7cc7885906ce1a1eb95e6e6c230fae5ce2
|
| 3 |
+
size 9444302
|
resources/link-to-flowchart.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
https://lucid.app/lucidchart/286fb57c-e78a-4723-857b-cbd92252af8a/edit
|
resources/llm_scores.csv
ADDED
|
@@ -0,0 +1,94 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
,baichuan-inc/Baichuan-7B,hfl/chinese-alpaca-2-7b,Qwen/Qwen-7B-Chat,Qwen/Qwen1.5-7B-Chat,HuggingFaceH4/zephyr-7b-beta,01-ai/Yi-6B-Chat,BAAI/AquilaChat2-7B-16K
|
| 2 |
+
"Overall (lower the better)",17,15,11,10,14,24,Generate nonsense
|
| 3 |
+
Ready to use,"2
|
| 4 |
+
|
| 5 |
+
Some special tokens returned but easy to clean up","1
|
| 6 |
+
|
| 7 |
+
No special tokens in responses","2
|
| 8 |
+
|
| 9 |
+
Some special tokens returned but easy to clean up","1
|
| 10 |
+
|
| 11 |
+
Robust even without system prompt","1
|
| 12 |
+
|
| 13 |
+
No special tokens in responses","3
|
| 14 |
+
|
| 15 |
+
Returning many unrelated texts, indicating post-processing requirements",NA
|
| 16 |
+
Instruction following - general,"2
|
| 17 |
+
|
| 18 |
+
Having problem distinguish the poem types","2
|
| 19 |
+
|
| 20 |
+
Having problem distinguish the poem types","1
|
| 21 |
+
|
| 22 |
+
Perfectly follows","2
|
| 23 |
+
|
| 24 |
+
almost follows except one case with Chinese-only instruction","2
|
| 25 |
+
|
| 26 |
+
Perfectly follows if ignoring language requirements on poems","5
|
| 27 |
+
|
| 28 |
+
Occasionally not answering questions at all",NA
|
| 29 |
+
Instruction Following - language,"3
|
| 30 |
+
|
| 31 |
+
Always answers in Chinese","3
|
| 32 |
+
|
| 33 |
+
Always answers in Chinese","1
|
| 34 |
+
|
| 35 |
+
Perfectly distinguishing output language requirements","2
|
| 36 |
+
|
| 37 |
+
almost follows except one case with Chinese-only instruction","2
|
| 38 |
+
|
| 39 |
+
Having problem with citing Chinese poems","1
|
| 40 |
+
|
| 41 |
+
Perfectly distinguishing output language requirements",NA
|
| 42 |
+
Helpfulness and Creativeness,"1
|
| 43 |
+
|
| 44 |
+
Answer questions with helpful contexts","1
|
| 45 |
+
|
| 46 |
+
Answer questions with helpful contexts","2
|
| 47 |
+
|
| 48 |
+
Very concise, sometimes too concise","1
|
| 49 |
+
|
| 50 |
+
Answer questions with helpful contexts","1
|
| 51 |
+
|
| 52 |
+
Answer questions with helpful contexts","3
|
| 53 |
+
|
| 54 |
+
Too verbose",NA
|
| 55 |
+
Fact,"2
|
| 56 |
+
|
| 57 |
+
Wrong answers for citing poem","2
|
| 58 |
+
|
| 59 |
+
Wrong answers for citing poem","1
|
| 60 |
+
|
| 61 |
+
No obvious mistakes","1
|
| 62 |
+
|
| 63 |
+
No obvious mistakes","2
|
| 64 |
+
|
| 65 |
+
Wrong answers for citing poem","3
|
| 66 |
+
|
| 67 |
+
Wrong answers for citing poem and country terriory",NA
|
| 68 |
+
Reasoning,"2
|
| 69 |
+
|
| 70 |
+
Self-consistent in reasoning but factually wrong","2
|
| 71 |
+
|
| 72 |
+
Self-consistent in reasoning but factually wrong","2
|
| 73 |
+
|
| 74 |
+
Self-consistent in reasoning but factually wrong","1
|
| 75 |
+
|
| 76 |
+
Self-consistent and factually correct","2
|
| 77 |
+
|
| 78 |
+
Self-consistent in reasoning but factually wrong","2
|
| 79 |
+
|
| 80 |
+
Self-consistent in reasoning but factually wrong",NA
|
| 81 |
+
Coding,"3
|
| 82 |
+
|
| 83 |
+
Valid code but wrong formatting or explanation","3
|
| 84 |
+
|
| 85 |
+
Valid code but wrong formatting or explanation","1
|
| 86 |
+
|
| 87 |
+
Perfect codes with explanation","1
|
| 88 |
+
|
| 89 |
+
Perfect codes with explanation","1
|
| 90 |
+
|
| 91 |
+
Perfect codes with explanation","4
|
| 92 |
+
|
| 93 |
+
Nonsense",NA
|
| 94 |
+
Inference Speed,2,1,1,1,3,3,3
|
resources/search-limit-chinese.png
ADDED
|
resources/test_llm_standalone_chat.txt
ADDED
|
@@ -0,0 +1,21 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
你是谁?
|
| 2 |
+
|
| 3 |
+
李白是谁?
|
| 4 |
+
|
| 5 |
+
请说出李白写过的三首诗的名字。
|
| 6 |
+
|
| 7 |
+
请全文背诵第二首诗。
|
| 8 |
+
|
| 9 |
+
李白和杜甫认识吗?请展示你的思考过程并陈述结论。
|
| 10 |
+
|
| 11 |
+
忘记前面的对话。告诉我到底莎士比亚的作品到底是哈姆雷特还是哈姆莱特?
|
| 12 |
+
|
| 13 |
+
请以莎士比亚为主题写一首古体诗,要求是七言绝句。
|
| 14 |
+
|
| 15 |
+
请以莎士比亚为主题写一首现代诗,不超过150字。
|
| 16 |
+
|
| 17 |
+
who created you?
|
| 18 |
+
|
| 19 |
+
Name the top 3 countries in the world based on how big they are.
|
| 20 |
+
|
| 21 |
+
I want to find the day of the week for the current date. Please write code in Python to fulfill such requirement.
|
resources/vectordatabase_evaluation.csv
ADDED
|
@@ -0,0 +1,6 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
,Redis,OpenSearch,Chroma,FAISS,Qdrant,Superbase,Pinecone
|
| 2 |
+
Offline/Local mode,Y,Y,Y,Y,Y,N,N
|
| 3 |
+
Serverless,N,N (requires docker),Y,Y,Y,N,Y
|
| 4 |
+
Offload to In-disk memory,Y,Y,Y,Y,N (can’t reload),Y,N
|
| 5 |
+
Support self-query,Y,Y,Y,N,Y,Y,Y
|
| 6 |
+
Support fuzzy match,"CONTAIN, LIKE","CONTAIN, LIKE",N,N,LIKE,LIKE,IN
|