DataBubble·

Model Detail

Olmo-Hybrid-Instruct-SFT-7B

—

Provider: AI2Category: llmPipeline: text-generationParameters: 7B

DB Score

0.3

Downloads

Likes

Day

+0.0%

Week

+0.0%

Month

+0.0%

Download History

Recent newsView all news →

Scaling of Gaussian Kolmogorov--Arnold Networks

arXiv:2604.21174v1 Announce Type: cross Abstract: The Gaussian scale parameter \(\epsilon\) is central to the behavior of Gaussian Kolmogorov--Arnold Networks (KANs), yet its role in deep edge-based architectures has not been studied systematically. In this paper, we investigate how \(\epsilon\) aff

arxiv5d ago

Optimized Architectures for Kolmogorov-Arnold Networks

arXiv:2512.12448v2 Announce Type: replace Abstract: Efforts to improve Kolmogorov--Arnold networks (KANs) with architectural enhancements have been stymied by the complexity those enhancements bring, undermining the interpretability that makes KANs attractive in the first place. Here we study overpr

arxiv6d ago

In-Context Symbolic Regression for Robustness-Improved Kolmogorov-Arnold Networks

arXiv:2603.15250v2 Announce Type: replace Abstract: Symbolic regression aims to replace black-box predictors with concise analytical expressions that can be inspected and validated in scientific machine learning. Kolmogorov-Arnold Networks (KANs) are well suited to this goal because each connection

arxivneutral7d ago

Olmo Hybrid: From Theory to Practice and Back

arXiv:2604.03444v3 Announce Type: replace-cross Abstract: Recent work has demonstrated the potential of non-transformer language models, especially linear recurrent neural networks (RNNs) and hybrid models that mix recurrence and attention. Yet there is no consensus on whether the potential benefits

arxivneutral10d ago

A Practitioner's Guide to Kolmogorov-Arnold Networks

arXiv:2510.25781v4 Announce Type: replace-cross Abstract: Kolmogorov-Arnold Networks (KANs), whose design is inspired-rather than dictated-by the Kolmogorov superposition theorem, have emerged as a structured alternative to MLPs. This review provides a systematic and comprehensive overview of the ra

arxiv12d ago

Olmo 3

arXiv:2512.13961v2 Announce Type: replace Abstract: We introduce Olmo 3, a family of state-of-the-art, fully-open language models at the 7B and 32B parameter scales. Olmo 3 model construction targets long-context reasoning, function calling, coding, instruction following, general chat, and knowledge

Related Models

Olmo-Hybrid-7B

AI2 · 16K downloads

Olmo-Hybrid-Instruct-DPO-7B

AI2 · 3K downloads

bert-base-uncased

google-bert · 58.9M downloads

paraphrase-multilingual-MiniLM-L12-v2

SBERT · 32.9M downloads