llama2.c

mirror of https://github.com/trholding/llama2.c.git synced 2026-02-06 03:16:50 +00:00

Author	SHA1	Message	Date
David A Roberts	52fe3653e5	Fix vocab_source in sample.py	2023-08-18 18:40:25 +10:00
Andrej Karpathy	fe2de68688	fix sample.py from tokenizer changes before	2023-08-15 02:33:01 +00:00
Andrej Karpathy	82ad2ba34e	remove tiktoken as dependency	2023-08-14 05:53:57 +00:00
Andrej Karpathy	58075b5ac5	update API of sample.py to be better, small changes here	2023-08-13 20:31:32 +00:00
Andrej Karpathy	38bfac90a8	bigchange: add multiquery support in run.c. we can now train and inference multiquery models (where n_kv_heads < n_heads). this also means that we, in principle, support Llama 2 34B and 70B models, which are multiquery	2023-08-13 19:34:05 +00:00
Andrej Karpathy	b0cfa2458d	ok i can train and sample a model with a custom tokenizer	2023-08-11 16:47:29 +00:00
Daniel Gross	518524f458	default to whatever system has	2023-07-23 10:41:03 -07:00
Andrej Karpathy	9414e7a45e	tweaks and add a simple test	2023-07-23 14:52:08 +00:00
Andrej Karpathy	5b161abb9a	somewhere ~20 hours later	2023-07-23 05:23:45 +00:00