Model Detail
bert-base-uncased
—bert-base-uncased is a large language model with 55M parameters released by google-bert. The model is registered under the fill-mask pipeline tag on Hugging Face, distributed under the permissive apache-2.0 license.
bert-base-uncased ships with 55M parameters. The apache-2.0 license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.
Downloads of bert-base-uncased have moved +10.0% over the trailing seven days, -3.0% over the trailing thirty days. The trend is mildly positive, consistent with a model that is being picked up incrementally rather than going viral. These numbers are signal, not guarantee — week-over-week download counts on Hugging Face also reflect mirror traffic, CI scrapes, and one-off benchmarking runs.
bert-base-uncased is best fit for general-purpose chat and instruction-following workloads. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.
KliniskVestBERT: BERT Model Specialised to Norwegian Clinical Texts
arXiv:2606.01904v2 Announce Type: replace-cross Abstract: The increasing application of Natural Language Processing (NLP) in healthcare demands language models specifically attuned to the complexities of clinical language. This work introduces KliniskVestBERT, a suite of three BERT-based encoder mod
The Word and the Way: Strategies for Domain-Specific BERT Pre-Training in German Medical NLP
arXiv:2606.03250v1 Announce Type: new Abstract: Digital healthcare generates vast amounts of clinical text that can support AI-assisted applications, yet German biomedical language models remain limited by older architectures or restricted training data. We present ChristBERT (Clinical- and Healthca
HalleluBERT: Let Every Token That Has Meaning Bear Its Weight
arXiv:2510.21372v2 Announce Type: replace Abstract: Transformer-based models have advanced NLP, yet Hebrew still lacks a RoBERTa encoder that is trained at scale and released in both base and large variants. We present HalleluBERT, a RoBERTa-based encoder family trained from scratch on 49.1~GB of de
Tiny Recursive Models for Solving the J2-Perturbed Lambert Problem
arXiv:2606.00895v1 Announce Type: cross Abstract: This paper presents a fast, recursive neural solver for the J2-perturbed Lambert problem based on Tiny Recursive Models (TRM), termed the TRM-Perturbed Lambert (TRM-PL) model. TRM is a weight-shared architecture whose effective capacity emerges from
PortBERT: Navigating the Depths of Portuguese Language Models
arXiv:2606.02100v1 Announce Type: new Abstract: Transformer models dominate modern NLP, but efficient, language-specific models remain scarce. In Portuguese, most focus on scale or accuracy, often neglecting training and deployment efficiency. In the present work, we introduce PortBERT, a family of
Construction of Historical Knowledge Graphs Based on BERT and Graph Neural Networks
arXiv:2606.01747v1 Announce Type: cross Abstract: Through digital humanities research and scale-up historical data analysis, a significant amount of traditional historical text is converted into structured knowledge graphs. This paper provides a high-level architecture that combines bidirectional en