r PRO
oceansweep
AI & ML interests
None yet
Recent Activity
liked
a model
about 22 hours ago
zai-org/GLM-ASR-Nano-2512
liked
a model
2 days ago
zai-org/GLM-4.6V-Flash
Organizations
None yet
LLMs-Using
-
CohereLabs/c4ai-command-r-plus
Text Generation ⢠104B ⢠Updated ⢠3.2k ⢠1.76k -
microsoft/Phi-3-medium-128k-instruct
Text Generation ⢠14B ⢠Updated ⢠11.6k ⢠385 -
crusoeai/Llama-3-8B-Instruct-Gradient-1048k-GGUF
8B ⢠Updated ⢠2.49k ⢠71 -
gradientai/Llama-3-8B-Instruct-262k
Text Generation ⢠8B ⢠Updated ⢠613 ⢠263
TTS
Music_Gen
Personal-Projects
Relevant-Papers-Midterm
-
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models
Paper ⢠2402.14848 ⢠Published ⢠20 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper ⢠2406.06608 ⢠Published ⢠68 -
CRAG -- Comprehensive RAG Benchmark
Paper ⢠2406.04744 ⢠Published ⢠48 -
Transformers meet Neural Algorithmic Reasoners
Paper ⢠2406.09308 ⢠Published ⢠44
Parametric-Compression
Modeling-Martial-Artists
GGUF-related
VLMs
-
openbmb/MiniCPM-V-2
Visual Question Answering ⢠3B ⢠Updated ⢠69.6k ⢠482 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text ⢠8B ⢠Updated ⢠1.15k ⢠28 -
HuggingFaceM4/idefics2-8b
Image-Text-to-Text ⢠8B ⢠Updated ⢠15.6k ⢠618 -
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Paper ⢠2311.06242 ⢠Published ⢠95
LLM-Models
-
mistralai/Mixtral-8x7B-Instruct-v0.1
47B ⢠Updated ⢠365k ⢠4.62k -
prince-canuma/WizardLM-2-8x22B
Text Generation ⢠141B ⢠Updated ⢠9 ⢠5 -
nvidia/Nemotron-4-340B-Instruct
Updated ⢠773 ⢠690 -
jinaai/jina-reranker-v2-base-multilingual
Text Ranking ⢠0.3B ⢠Updated ⢠471k ⢠330
Datasweep
Papers
MAMBA-Models
Training-related
Coding
GGUF-related
LLMs-Using
-
CohereLabs/c4ai-command-r-plus
Text Generation ⢠104B ⢠Updated ⢠3.2k ⢠1.76k -
microsoft/Phi-3-medium-128k-instruct
Text Generation ⢠14B ⢠Updated ⢠11.6k ⢠385 -
crusoeai/Llama-3-8B-Instruct-Gradient-1048k-GGUF
8B ⢠Updated ⢠2.49k ⢠71 -
gradientai/Llama-3-8B-Instruct-262k
Text Generation ⢠8B ⢠Updated ⢠613 ⢠263
VLMs
-
openbmb/MiniCPM-V-2
Visual Question Answering ⢠3B ⢠Updated ⢠69.6k ⢠482 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text ⢠8B ⢠Updated ⢠1.15k ⢠28 -
HuggingFaceM4/idefics2-8b
Image-Text-to-Text ⢠8B ⢠Updated ⢠15.6k ⢠618 -
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Paper ⢠2311.06242 ⢠Published ⢠95
TTS
LLM-Models
-
mistralai/Mixtral-8x7B-Instruct-v0.1
47B ⢠Updated ⢠365k ⢠4.62k -
prince-canuma/WizardLM-2-8x22B
Text Generation ⢠141B ⢠Updated ⢠9 ⢠5 -
nvidia/Nemotron-4-340B-Instruct
Updated ⢠773 ⢠690 -
jinaai/jina-reranker-v2-base-multilingual
Text Ranking ⢠0.3B ⢠Updated ⢠471k ⢠330
Music_Gen
Datasweep
Personal-Projects
Papers
Relevant-Papers-Midterm
-
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models
Paper ⢠2402.14848 ⢠Published ⢠20 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper ⢠2406.06608 ⢠Published ⢠68 -
CRAG -- Comprehensive RAG Benchmark
Paper ⢠2406.04744 ⢠Published ⢠48 -
Transformers meet Neural Algorithmic Reasoners
Paper ⢠2406.09308 ⢠Published ⢠44
MAMBA-Models
Parametric-Compression
Training-related
Modeling-Martial-Artists