arxiv1d agobullish

What Matters When Building Universal Multilingual Named Entity Recognition Models?

arXiv:2601.06347v2 Announce Type: replace Abstract: Recent progress in universal multilingual named entity recognition (NER) has been driven by multilingual transformer models, task-specific architectures, custom loss functions, and large-scale training datasets. However, despite substantial prior w

OT1 model #multilingual #ner #transformer Read on arxiv →

arxiv1d agobullish

Spatially-Enhanced Temporal Fusion Transformer: Interpretable Multi-Output Prediction for Parametric Dynamical Systems with Time-Varying Inputs

arXiv:2505.00473v2 Announce Type: replace Abstract: We explore the promising performance of a transformer model in predicting outputs of parametric dynamical systems with external time-varying input signals. The outputs of such systems vary not only with physical parameters but also with external ti

TESP2 models #machine-learning #transformer #dynamical-systems Read on arxiv →

arxiv4d agobullish

LSRM: High-Fidelity Object-Centric Reconstruction via Scaled Context Windows

arXiv:2604.05182v3 Announce Type: replace-cross Abstract: We introduce the Large Sparse Reconstruction Model to study how scaling transformer context windows affects feed-forward 3D reconstruction. Although recent object-centric feed-forward methods produce robust, high-quality reconstructions, they

LA1 model #computer-vision #3d-reconstruction #inverse-rendering Read on arxiv →

arxivJun 26bullish

Transformer-Based Classification of Bacterial Raman Spectra with LOOCV

arXiv:2606.27096v1 Announce Type: new Abstract: Transformer-based models have recently attracted increasing attention for Raman spectral classification. In this study, a transformer-based approach was systematically evaluated using a nested leave-one-replicate-out cross-validation framework and comp

TR1 model #machine-learning #raman-spectra #classification Read on arxiv →

arxivJun 15bullish

Exact Linear Attention

arXiv:2605.18848v4 Announce Type: replace-cross Abstract: This paper introduces Exact Linear Attention (ELA), a mechanism that achieves linear computational complexity for Transformer attention by exploiting the exact decomposition property of kernel functions, thereby eliminating approximation erro

TRYO2 models #machine learning #transformer #attention mechanisms Read on arxiv →

arxivJun 12bullish

Learning Instance-Adaptive Low-Rank Orthogonal Subspaces for Clothes-Changing Person Re-Identification

arXiv:2606.11661v1 Announce Type: cross Abstract: Clothes-changing person re-identification (CC-ReID) aims to recognize individuals despite drastic appearance changes caused by clothing variation. While existing methods rely on adversarial learning to disentangle clothing features, we propose Ortho-

ORBA2 models #computer-vision #machine-learning #reidentification Read on arxiv →

arxivJun 1bullish

RayDer: Scalable Self-Supervised Novel View Synthesis from Real-World Video

arXiv:2605.31535v1 Announce Type: cross Abstract: Self-supervised novel view synthesis (NVS) remains challenging to scale, despite the abundance of video data, largely due to the brittleness of training on realistic videos and the hard-to-predict scaling behavior of multi-network system designs. We

RA1 model #computer-vision #self-supervised #transformer Read on arxiv →

arxivMay 14bullish

ASAP: Amortized Doubly-Stochastic Attention via Sliced Dual Projection

arXiv:2605.12879v1 Announce Type: new Abstract: Doubly-stochastic attention has emerged as a transport-based alternative to row-softmax attention, with recent Transformer variants using it to reduce attention sinks and rank collapse while improving performance. In this family, the standard approach

SIAS2 models #transformer #attention #machine-learning Read on arxiv →

arxivApr 6bullish

Contrastive Language-Colored Pointmap Pretraining for Unified 3D Scene Understanding

arXiv:2604.02546v1 Announce Type: cross Abstract: Pretraining 3D encoders by aligning with Contrastive Language Image Pretraining (CLIP) has emerged as a promising direction to learn generalizable representations for 3D scene understanding. In this paper, we propose UniScene3D, a transformer-based e

OPUN2 models #computer-vision #3d-scene-understanding #transformer Read on arxiv →