Skip to content

Commit a1ba367

Browse files
committed
Add Guru.yaml
1 parent 5f7bef4 commit a1ba367

File tree

1 file changed

+105
-0
lines changed

1 file changed

+105
-0
lines changed

Guru.yaml

Lines changed: 105 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,105 @@
1+
---
2+
# Thank you for contributing!
3+
# In filling out this yaml file, please follow the criteria as described here:
4+
# https://osai-index.eu/contribute
5+
6+
# You're free to build on this work and reuse the data. It is licensed under CC-BY 4.0, with the
7+
# stipulation that attribution should come in the form of a link to https://osai-index.eu/
8+
# and a citation to the peer-reviewed paper in which the dataset & criteria were published:
9+
10+
# Liesenfeld, A. and Dingemanse, M., 2024. Rethinking open source generative AI: open-washing and the EU AI Act. In Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (pp. 1774-1787).
11+
12+
# Organization tags:
13+
# - National origin: United States
14+
# - Contributor type: Non-academic (Research institution)
15+
16+
system:
17+
name: Guru
18+
link: https://huggingface.co/LLM360/guru-32B
19+
type: text
20+
performanceclass: latest
21+
basemodelname: Guru-32B
22+
endmodelname: Qwen2.5-32B
23+
endmodellicense: CC-BY-NC-4.0
24+
releasedate: 2025-05
25+
notes: Guru, a model which claims to improve upon the state-of-the-art.
26+
27+
org:
28+
name: LLM360
29+
link: https://www.llm360.ai/index.html
30+
notes: LLM360, an initiative to fully open-source LLMs.
31+
32+
# availability:
33+
datasources_basemodel:
34+
class: closed
35+
link:
36+
notes: Pretraining data not specified or documented.
37+
38+
datasources_endmodel:
39+
class: open
40+
link: https://huggingface.co/datasets/LLM360/guru-RL-92k
41+
notes: Dataset published on HuggingFace.
42+
43+
weights_basemodel:
44+
class: open
45+
link: https://huggingface.co/Qwen/Qwen2.5-32B
46+
notes: Model weights made available on HuggingFace.
47+
48+
weights_endmodel:
49+
class: open
50+
link: https://huggingface.co/LLM360/guru-32B
51+
notes: Model weights made available on HuggingFace.
52+
53+
trainingcode:
54+
class: partial
55+
link: ["https://github.com/QwenLM", "https://github.com/LLM360/Reasoning360/tree/main/scripts/train"]
56+
notes: Base model repository provides sparse source code and some examples for SFT. End model repository contains training script
57+
58+
# documentation:
59+
code:
60+
class: partial
61+
link: ["https://github.com/QwenLM", "https://github.com/LLM360/Reasoning360"]
62+
notes: Both repositories are fairly well-documented.
63+
64+
hardware_architecture:
65+
class: partial
66+
link: ["https://arxiv.org/pdf/2506.14965", "https://github.com/LLM360/Reasoning360/blob/main/scripts/train/example_multinode_rl_qwen32b_base.sh"]
67+
notes: Hardware architecture described for end model.
68+
69+
preprint:
70+
class: open
71+
link: ["https://arxiv.org/abs/2505.09388", "https://arxiv.org/pdf/2506.14965"]
72+
notes: Preprints published on arXiv.
73+
74+
paper:
75+
class: closed
76+
link:
77+
notes: No peer-reviewed paper found.
78+
79+
modelcard:
80+
class: closed
81+
link: https://huggingface.co/LLM360/guru-32B
82+
notes: Model card primarily contains usage instructions.
83+
84+
datasheet:
85+
class: open
86+
link: https://huggingface.co/datasets/LLM360/guru-RL-92k
87+
notes: Detailed datasheet provided on HuggingFace.
88+
89+
# access:
90+
package:
91+
class: closed
92+
link:
93+
notes: No package found.
94+
95+
api:
96+
class: closed
97+
link:
98+
notes: No API found.
99+
metaprompt: closed
100+
101+
licenses:
102+
class: open
103+
link: https://huggingface.co/LLM360/guru-32B
104+
notes: CC-BY-NC-4.0, an OSI-approved license.
105+

0 commit comments

Comments
 (0)