DataBubble·

Model Detail

DeepSeek-Coder-V2-Lite-Instruct

—

Provider: DeepSeekCategory: codePipeline: text-generation

DB Score

36.9

Downloads

199K

Likes

555

Day

+0.0%

Week

+0.0%

Month

+0.0%

Overview

DeepSeek-Coder-V2-Lite-Instruct is a code generation model released by DeepSeek. The model is registered under the text-generation pipeline tag on Hugging Face.

Pricing & Throughput

DeepSeek-Coder-V2-Lite-Instruct is priced at $0/M input tokens and $0/M output tokens. Operationally the model offers a 33K-token context window, which matters when sizing it for prompt-heavy or latency-sensitive workloads. At this input rate the model sits in the commodity tier and is suitable for high-volume workloads where per-call cost dominates the decision.

Technical

DeepSeek-Coder-V2-Lite-Instruct is published on Hugging Face but our pipeline has not yet captured architecture, license, or parameter-count metadata for this entry. The data is refreshed daily, so these fields typically populate within 24–48 hours of release.

Use Cases

DeepSeek-Coder-V2-Lite-Instruct is best fit for code completion, repository-scale Q&A, and pair-programming integrations, and high-volume batch jobs where per-call cost dominates the budget. It is a less obvious choice for one-shot generation of security-critical code without review. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History

Research Paper

arXiv: 2401.14196→

Model Info

Citations1,784 (261 influential)

Recent newsView all news →

FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention

arXiv:2606.09079v3 Announce Type: replace-cross Abstract: Conventional LLMs keep the full KV cache loaded during decoding, causing a severe GPU memory bottleneck for ultra-long context serving. In this report, we propose \textbf{Lookahead Sparse Attention (LSA)}, a novel inference paradigm powered b

arxivneutral31d ago

DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence

arXiv:2606.19348v1 Announce Type: cross Abstract: We present a preview version of DeepSeek-V4 series, including two strong Mixture-of-Experts (MoE) language models -- DeepSeek-V4-Pro with 1.6T parameters (49B activated) and DeepSeek-V4-Flash with 284B parameters (13B activated) -- both supporting a

arxiv41d ago

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

arXiv:2606.10392v1 Announce Type: new Abstract: Financial named-entity recognition (NER) is essential for translating unstructured financial reports and news into structured knowledge graphs. However, general-purpose large language models (LLMs) often misclassify financial entities or ignore domain-

arxiv56d ago

SoK: A Comprehensive Security Analysis of Jailbreak Resilience in GPT and DeepSeek Models

arXiv:2506.18543v2 Announce Type: replace-cross Abstract: The rapid proliferation of Large Language Models (LLMs) has heightened concerns regarding their exposure to jailbreak attacks, which craft adversarial inputs designed to elicit unsafe content. Although proprietary models such as GPT-4 have be

arxiv56d ago

DeepSeekMath Meets Order Book: Group-Aware Policy Optimization for High-Frequency Directional Trading

arXiv:2605.25527v1 Announce Type: new Abstract: This paper studies reinforcement learning for high-frequency trading on limit order books by pairing an Order-Flow-based state model with policy-gradient methods. Instead of value-based RL techniques like tabular Q-learning, our approach deploys policy

arxivbullish33d ago

Attribution-Guided and Coverage-Maximized Pruning for Structural MoE Compression

arXiv:2606.18304v1 Announce Type: cross Abstract: Mixture-of-Experts (MoE) models scale compute efficiently, yet remain expensive to deploy due to their substantial memory footprint and inference overhead. Prior compression methods mainly operate at the expert level, either removing entire experts o

Related Models

DeepSeek-V3.2

DeepSeek · 11.2M downloads

DeepSeek-R1

DeepSeek · 8.6M downloads

all-MiniLM-L6-v2

SBERT · 240.6M downloads

nomic-embed-text-v1.5

nomic-ai · 17.1M downloads