DataBubble·

Model Detail

Qwen3-14B-Claude-4.5-Opus-High-Reasoning-Distill-GGUF

—

Provider: TeichAICategory: llmPipeline: text-generationParameters: 14B

DB Score

0.8

Downloads

93K

Likes

287

Day

+0.0%

Week

+0.0%

Month

+0.0%

Download History

Research Paper

arXiv: 2309.16609→

Model Info

Citations3,494 (385 influential)

Recent newsView all news →

Qwen3.5-Omni Technical Report

arXiv:2604.15804v2 Announce Type: replace Abstract: In this work, we present Qwen3.5-Omni, the latest advancement in the Qwen-Omni model family. Representing a significant evolution over its predecessor, Qwen3.5-Omni scales to hundreds of billions of parameters and supports a 256k context length. By

arxivneutral10d ago

Benchmarking Linguistic Adaptation in Comparable-Sized LLMs: A Study of Llama-3.1-8B, Mistral-7B-v0.1, and Qwen3-8B on Romanized Nepali

arXiv:2604.14171v1 Announce Type: new Abstract: Romanized Nepali, the Nepali language written in the Latin alphabet, is the dominant medium for informal digital communication in Nepal, yet it remains critically underresourced in the landscape of Large Language Models (LLMs). This study presents a sy

arxivneutral10d ago

QU-NLP at ArchEHR-QA 2026: Two-Stage QLoRA Fine-Tuning of Qwen3-4B for Patient-Oriented Clinical Question Answering and Evidence Sentence Alignment

arXiv:2604.14175v1 Announce Type: new Abstract: We present a unified system addressing both Subtask 3 (answer generation) and Subtask 4 (evidence sentence alignment) of the ArchEHR-QA Shared Task. For Subtask 3, we apply two-stage Quantised Low-Rank Adaptation (QLoRA) to Qwen3-4B loaded in 4-bit NF4

arxivneutral17d ago

Robustness Risk of Conversational Retrieval: Identifying and Mitigating Noise Sensitivity in Qwen3-Embedding Model

arXiv:2604.06176v1 Announce Type: cross Abstract: We present an empirical study of embedding-based retrieval under realistic conversational settings, where queries are short, dialogue-like, and weakly specified, and retrieval corpora contain structured conversational artifacts. Focusing on Qwen3-emb

arxivneutral18d ago

Gemma 4, Phi-4, and Qwen3: Accuracy-Efficiency Tradeoffs in Dense and MoE Reasoning Language Models

arXiv:2604.07035v1 Announce Type: new Abstract: Mixture-of-experts (MoE) language models are often expected to offer better quality-efficiency tradeoffs than dense models because only a subset of parameters is activated per token, but the practical value of that advantage depends on end-to-end behav

Related Models

gemma-4-31B-it-Claude-Opus-Distill-GGUF

TeichAI · 122K downloads

GLM-4.7-Flash-Claude-Opus-4.5-High-Reasoning-Distill-GGUF

TeichAI · 59K downloads

bert-base-uncased

google-bert · 58.9M downloads

paraphrase-multilingual-MiniLM-L12-v2

SBERT · 32.9M downloads