Interview request: genAI evaluation & documentation
#61 opened about 1 year ago
		by
		
				
 meggymuggy
							
						meggymuggy
	
language dependency
#60 opened over 1 year ago
		by
		
				
 Jay369
							
						Jay369
	
[AUTOMATED] Model Memory Requirements
#59 opened over 1 year ago
		by
		
				
 model-sizer-bot
							
						model-sizer-bot
	
Deployments to Azure and Inference Endpoints
#55 opened over 1 year ago
		by
		
				
 mo2024
							
						mo2024
	
Very sensitve to any repetition penalty!
π
							
						1
				#52 opened over 1 year ago
		by
		
				
 jukofyork
							
						jukofyork
	
 
							Text2SQL2Output
#51 opened over 1 year ago
		by
		
				
 Sudipta179002
							
						Sudipta179002
	
The generated response cannot stop.
									1
	#50 opened over 1 year ago
		by
		
				
 shaohuay
							
						shaohuay
	
Saving dbrx model and tokenizer in dbfs
									5
	#49 opened over 1 year ago
		by
		
				
 pro-shep
							
						pro-shep
	
 
							OSError: Unable to load vocabulary from file
									7
	#47 opened over 1 year ago
		by
		
				
 khurramnaseem
							
						khurramnaseem
	
TypeError: __init__() got an unexpected keyword argument 'bias'
									2
	#46 opened over 1 year ago
		by
		
				
 dainesn1
							
						dainesn1
	
[DO NOT REVIEW] Mixtral like config
#45 opened over 1 year ago
		by
		
				
 Pernekhan
							
						Pernekhan
	
Why clamp qkv_states, is it common?
#44 opened over 1 year ago
		by
		
				
 jay68
							
						jay68
	
Chat template
									9
	#43 opened over 1 year ago
		by
		
				
 ehartford
							
						ehartford
	
 
							GGUF quants?
									1
	#41 opened over 1 year ago
		by
		
				
 Iommed
							
						Iommed
	
Does the tokenizer of this model have a network to load successfully?
									3
	#40 opened over 1 year ago
		by
		
				
 Rnake
							
						Rnake
	
VRAM Requirements?
									8
	#39 opened over 1 year ago
		by
		
				
 dounykim
							
						dounykim
	
How to get hands on experience as a newbie
									1
	#38 opened over 1 year ago
		by
		
				
 kimsia
							
						kimsia
	
Text2sql template and examples
									3
	#34 opened over 1 year ago
		by
		
				
 daxiongshu
							
						daxiongshu
	
Continuation of the Discussion: More than 10 minutes the status is in Setting `pad_token_id` to `eos_token_id`:100257 for open-end generation. #28
β
							
						2
				
									7
	#31 opened over 1 year ago
		by
		
				
 Madhugraj
							
						Madhugraj
	
Errors During Training for the Original Implementation and the Fixes for the Errors
π
							
						2
				
									2
	#24 opened over 1 year ago
		by
		
				
 v2ray
							
						v2ray
	
 
							Instruct dataset
π
							
						2
				#23 opened over 1 year ago
		by
		
				
 Andriy
							
						Andriy
	
How to Fine Tune DBRX-Instruct?
									7
	#18 opened over 1 year ago
		by
		
				
 elysiia
							
						elysiia
	
 
							Bug on AMD MI 250 with flash-attention
								3
#13 opened over 1 year ago
		by
		
				
 PierreColombo
							
						PierreColombo
	
The fused expert parameters means load_in_4bit doesn't work properly, nor does LoRA
π§ 
							π
							
						7
				
								31
#10 opened over 1 year ago
		by
		
				
 tdrussell
							
						tdrussell
	
