31 4 29

Raymond Ng

RaymondAISG

AI & ML interests

Foundation Model; Natural Language Processing; Deep Learning;

Recent Activity

upvoted an article about 18 hours ago

BioClinical ModernBERT: an example of continued pre-training of ModernBERT

new activity 7 months ago

common-pile/foodista_filtered:ArrowInvalid Exception: Failed to parse string: '' as a scalar of type timestamp[s]

liked a Space 10 months ago

nanotron/ultrascale-playbook

View all activity

Organizations

upvoted an article about 18 hours ago

Article

BioClinical ModernBERT: an example of continued pre-training of ModernBERT

Sep 10

•

New activity in common-pile/foodista_filtered 7 months ago

ArrowInvalid Exception: Failed to parse string: '' as a scalar of type timestamp[s]

#2 opened 7 months ago by

RaymondAISG

liked a Space 10 months ago

The Ultra-Scale Playbook

🌌

3.6k

The ultimate guide to training LLM on large GPU Clusters

New activity in aisingapore/Llama-SEA-LION-v3-70B-IT 11 months ago

Instruction and answer must be in the same language?

#1 opened 11 months ago by

rub2000

New activity in aisingapore/Gemma-SEA-LION-v3-9B-IT 11 months ago

Update config.json so that it can be run by llm serving engine

#3 opened 11 months ago by

Pinkgu1

Update config.json

#2 opened 11 months ago by

Pinkgu1

updated a collection 11 months ago

RLMs

Collection

2 items • Updated Jan 24

upvoted a paper 11 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 431

authored a paper about 1 year ago

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Paper • 2412.03304 • Published Dec 4, 2024 • 19

New activity in aisingapore/SEA-LION-v1-7B-IT about 1 year ago

Model doesn't run under HF's Transformers / Inference Endpoints

#9 opened about 1 year ago by

gtie

New activity in aisingapore/SEA-LION-v1-7B over 1 year ago

Is SEA-LION trained on Singaporean culture?

#13 opened over 1 year ago by

SBSTFRNNDZ

updated 4 models over 1 year ago

liked 3 models over 1 year ago

aisingapore/Llama-SEA-LION-v2-8B

Text Generation • 8B • Updated Apr 15 • 92 • 4

aisingapore/Llama-SEA-LION-v2-8B-IT

Text Generation • 8B • Updated Apr 15 • 602 • • 17

aisingapore/SEA-LION-v1-7B-IT

Text Generation • 8B • Updated Apr 14 • 857 • 24

New activity in aisingapore/SEA-LION-v1-7B over 1 year ago

python llama.cpp/convert-hf-to-gguf.py ~/sea-liton-7b/ error

#10 opened almost 2 years ago by

pacozaa

New activity in aisingapore/SEA-LION-v1-7B-IT over 1 year ago

System Prompt

#5 opened over 1 year ago by

anhnh2002

Raymond Ng

AI & ML interests

Recent Activity

Organizations

RaymondAISG's activity

BioClinical ModernBERT: an example of continued pre-training of ModernBERT

ArrowInvalid Exception: Failed to parse string: '' as a scalar of type timestamp[s]

The Ultra-Scale Playbook

Instruction and answer must be in the same language?

Update config.json so that it can be run by llm serving engine

Update config.json

Model doesn't run under HF's Transformers / Inference Endpoints

Is SEA-LION trained on Singaporean culture?

python llama.cpp/convert-hf-to-gguf.py ~/sea-liton-7b/ error

System Prompt