Update collated assets CSV.

stanford-crfm · Dec 4, 2024 · 2180237 · 2180237
1 parent 206daeb
commit 2180237
Showing 1 changed file with 0 additions and 1 deletion.
diff --git a/resources/all_assets.csv b/resources/all_assets.csv
@@ -3,7 +3,6 @@ model,GodziLLa 2,Maya Philippines,"GodziLLa 2 is an experimental combination of
 dataset,AutoMathText,Math AI,AutoMathText is an extensive and carefully curated dataset encompassing around 200 GB of mathematical texts.,2024-02-12,https://github.com/yifanzhang-pro/AutoMathText,,text,Mistral model fine-tuned on AutoMathText and evaluated on the MATH dataset.,200 GB,"['OpenWebMath', 'RedPajama-Data', 'Algebraic Stack', 'Qwen']",,,,,open,CC BY-SA 4.0,,,unknown,https://huggingface.co/datasets/math-ai/AutoMathText/discussions,https://huggingface.co/datasets/math-ai/AutoMathText,[],,,,,,,,
 model,Yi,01 AI,The Yi series models are large language models trained from scratch by developers at 01 AI.,2023-11-02,https://github.com/01-ai/Yi,https://huggingface.co/01-ai/Yi-34B,text; text,"Evaluated on standard language benchmarks, common sense reasoning, and reading comprehension in comparison to SoTA LLMs.",34B parameters (dense),[],unknown,unknown,unknown,"Model underwent supervised fine-tuning, leading to a greater diversity of responses.",open,custom,,,unknown,https://huggingface.co/01-ai/Yi-34B/discussions,,,,,,,,,,
 model,Yi-VL,01 AI,"The Yi Vision Language (Yi-VL) model is the open-source, multimodal version of the Yi Large Language Model (LLM) series, enabling content comprehension, recognition, and multi-round conversations about images.",2024-01-23,https://github.com/01-ai/Yi,https://huggingface.co/01-ai/Yi-VL-34B,text; text,"Yi-VL outperforms all existing open-source models in MMMU and CMMMU, two advanced benchmarks that include massive multi-discipline multimodal questions (based on data available up to January 2024).",34B parameters (dense),[],unknown,10 days,128 NVIDIA A800 (80G) GPUs,unknown,open,custom,,,unknown,https://huggingface.co/01-ai/Yi-VL-34B/discussions,,,,,,,,,,
-model,Chai-1,Chai Discovery Team,"Chai-1 is a multi-modal foundation model for molecular structure prediction that performs at the state-of-the-art across a variety of tasks relevant to drug discovery. It enables unified prediction of proteins, small molecules, DNA, RNA, covalent modifications, and more. It tested with a 77% success rate on the PoseBusters benchmark and an CÎ± LDDT of 0.849 on the CASP15 protein monomer structure prediction set.",2024-09-09,https://www.chaidiscovery.com/blog/introducting-chai-1,,Unknown,"The model was tested across a large number of benchmarks and found to achieve a 77% success rate on the PoseBusters benchmark (vs. 76% by AlphaFold3), as well as an CÎ± LDDT of 0.849 on the CASP15 protein monomer structure prediction set (vs. 0.801 by ESM3-98B). The model can also run in single sequence mode without MSAs while preserving most of its performance.",Unknown,Unknown,Unknown,Unknown,Unknown,Unknown,The model is available for free via a web interface for commercial applications such as drug discovery and the model weights and inference code are also released as a software library for non-commercial use.,Unknown,Can be used for drug discovery and other applications that require molecular structure prediction.,Unknown,Unknown,Unknown,,,,,,,,,,
 model,Midm,KT Corporation,Midm is a pre-trained Korean-English language model developed by KT. It takes text as input and creates text. The model is based on Transformer architecture for an auto-regressive language model.,2023-10-31,https://huggingface.co/KT-AI/midm-bitext-S-7B-inst-v1,https://huggingface.co/KT-AI/midm-bitext-S-7B-inst-v1,text; text,unknown,7B parameters,"['AI-HUB dataset', 'National Institute of Korean Language dataset']",unknown,unknown,unknown,"KT tried to remove unethical expressions such as profanity, slang, prejudice, and discrimination from training data.",open,CC-BY-NC 4.0,It is expected to be used for various research purposes.,It cannot be used for commercial purposes.,unknown,https://huggingface.co/KT-AI/midm-bitext-S-7B-inst-v1/discussions,,,,,,,,,,
 model,ACT-1,Adept,ACT-1 (ACtion Transformer) is a large-scale transformer model designed and trained specifically for taking actions on computers (use software tools APIs and websites) in response to the user's natural language commands.,2022-09-14,https://www.adept.ai/blog/act-1,,text; text,,,[],unknown,unknown,unknown,,closed,unknown,,,,,,,,,,,,,,
 model,Persimmon,Adept,"Persimmon is the most capable open-source, fully permissive model with fewer than 10 billion parameters, as of its release date.",2023-09-07,https://www.adept.ai/blog/persimmon-8b,,text; text,"Evaluated in comparison to LLaMA 2 and MPT Instruct, and outperforms both on standard benchmarks.",8B parameters (dense),[],,,,,open,Apache 2.0,,,,,,,,,,,,,,