geodesic-research/fewshot-discourse-grounded-misalignment-evals
geodesic-research/discourse-grounded-synthetic-scenario-hhh-sft
Viewer
•
Updated
•
26.1k
•
11
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer
•
Updated
•
14.9M
•
98
geodesic-research/sfm-mcqa-sft-mix
Viewer
•
Updated
•
973k
•
19
geodesic-research/discourse-grounded-misalignment-evals
Viewer
•
Updated
•
4.17k
•
299
geodesic-research/sfm-sft-multitask-benign-tampering-mix
Viewer
•
Updated
•
1.86M
•
85
geodesic-research/sfm-midtraining-mix-ai-filtering-results
Viewer
•
Updated
•
42.8M
•
6
geodesic-research/sfm-pretraining-mix-ai-filtering-results
Viewer
•
Updated
•
406M
•
29
geodesic-research/Dolci-Instruct-SFT-Python-Correct
Viewer
•
Updated
•
885k
•
3
geodesic-research/alignment-tampering-sft-mix
Viewer
•
Updated
•
20k
geodesic-research/hyperstition-character-stories-9.6k
Viewer
•
Updated
•
9.62k
•
6
geodesic-research/synth-scenario-docs-positive-alignment-midtraining
Viewer
•
Updated
•
327k
•
24
•
1
geodesic-research/sfm-supplemental-alignment-literature
Viewer
•
Updated
•
139
•
7
geodesic-research/midtraining_mix_modernbert_filtered_documents
Viewer
•
Updated
•
1.34M
•
1
geodesic-research/sfm-alignment-labeling-v3
Viewer
•
Updated
•
143k
•
5
geodesic-research/anthropic-propensity-evals-human-written-refined
Viewer
•
Updated
•
4.28k
•
819
•
1