·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Theker just raised $85M to build the factory robot that doesn’t specialize in anything1h◆Jeff Bezos’s Prometheus raises $12B to build an ‘artificial general engineer’ for the physical world1h◆SpaceX officially prices shares at $135 in the largest IPO ever6h◆Our new community investments in Virginia support local jobs and expand energy affordability.6h◆SpaceX SPV investors won’t know their true holdings until post-IPO lock-ups lift6h◆Amazon’s data centers used 2.5 billion gallons of water last year9h◆Deezer’s new tool can identify AI music from Spotify, Apple Music, and others10h◆Pool’s new app turns your screenshots into something useful11h◆DoorDash’s new AI chatbot lets you order with prompts and photos12h◆Anthropic apologizes for invisible Claude Fable guardrails15h◆Google DeepMind is worried about what happens when millions of agents start to interact15h◆Deezer launches an AI music detector for other streaming services18h◆Opendoor’s India exit is fueling a bigger conversation about AI and outsourcing22h◆MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning22h◆Position: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Traces!22h◆ARGUS: Stacked Multi-View Identity Mosaic Injection for Subject-Preserving Video Generation22h◆Generalizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Solutions22h◆The Impossibility of Eliciting Latent Knowledge22h◆Mapping Scientific Literature with Large Language Models and Topic Modeling22h◆Grounding Computer Use Agents on Human Demonstrations22h◆Theker just raised $85M to build the factory robot that doesn’t specialize in anything1h◆Jeff Bezos’s Prometheus raises $12B to build an ‘artificial general engineer’ for the physical world1h◆SpaceX officially prices shares at $135 in the largest IPO ever6h◆Our new community investments in Virginia support local jobs and expand energy affordability.6h◆SpaceX SPV investors won’t know their true holdings until post-IPO lock-ups lift6h◆Amazon’s data centers used 2.5 billion gallons of water last year9h◆Deezer’s new tool can identify AI music from Spotify, Apple Music, and others10h◆Pool’s new app turns your screenshots into something useful11h◆DoorDash’s new AI chatbot lets you order with prompts and photos12h◆Anthropic apologizes for invisible Claude Fable guardrails15h◆Google DeepMind is worried about what happens when millions of agents start to interact15h◆Deezer launches an AI music detector for other streaming services18h◆Opendoor’s India exit is fueling a bigger conversation about AI and outsourcing22h◆MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning22h◆Position: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Traces!22h◆ARGUS: Stacked Multi-View Identity Mosaic Injection for Subject-Preserving Video Generation22h◆Generalizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Solutions22h◆The Impossibility of Eliciting Latent Knowledge22h◆Mapping Scientific Literature with Large Language Models and Topic Modeling22h◆Grounding Computer Use Agents on Human Demonstrations22h◆
Tag

#artificial-intelligence

46 articles tagged #artificial-intelligence

arxiv6d ago

Abduction Prover in Isabelle/HOL

arXiv:2606.04877v1 Announce Type: cross Abstract: Proof assistants based on expressive logics suffer limited automation for proof search, raising the cost of formal verification based on proof assistants. We address this problem by introducing the Abduction Prover for Isabelle/HOL. Given a challengi

#formal-verification#proof-assistants#artificial-intelligenceRead on arxiv →
arxivJun 1bearish

Multi-Agent Teams Hold Experts Back

arXiv:2602.01011v4 Announce Type: replace-cross Abstract: Multi-agent LLM systems are increasingly deployed as autonomous collaborators, where agents interact freely rather than execute fixed, pre-specified workflows. In such settings, effective coordination cannot be fully designed in advance and m

#multiagent-systems#artificial-intelligence#machine-learningRead on arxiv →
arxivMay 29bullish

Extreme dynamic symmetry enables omnidirectional and multifunctional robots

arXiv:2605.29254v1 Announce Type: cross Abstract: Symmetry is a central organizing principle in natural systems, yet its use as a unifying design strategy in robotics has largely remained limited to geometric form. We show that symmetry can instead be leveraged at the level of dynamic actuation capa

#robotics#artificial-intelligence#researchRead on arxiv →
arxivMay 29

The Best of the Two Worlds: Harmonizing Semantic and Hash IDs for Sequential Recommendation

arXiv:2512.10388v2 Announce Type: replace-cross Abstract: Conventional Sequential Recommender Systems (SRS) typically assign unique hash IDs (HID) to construct item embeddings, which mainly capture collaborative signals from historical user-item interactions. However, such embeddings are vulnerable

#recommendation-systems#information-retrieval#artificial-intelligenceRead on arxiv →
arxivMay 28bullish

Learning Compositional Latent Structure with Vector Networks

arXiv:2605.28007v1 Announce Type: cross Abstract: Deep networks are powerful function approximators, but they typically store many different computations in shared weight matrices, making it difficult to selectively reuse or adapt parts of them when a familiar structure appears in novel combinations

VE1 model#machine-learning#artificial-intelligence#neural-networksRead on arxiv →
arxivMay 28bullish

Smaller, Younger, and More Impactful: How AI-Assisted Writing Transforms Research Teams

arXiv:2605.27404v1 Announce Type: cross Abstract: The era of Big Science has long been defined by increasingly large and specialized research teams pushing the frontiers of knowledge. However, recent advances in artificial intelligence (AI), particularly large language models (LLMs), are beginning t

#artificial-intelligence#research#academic-writingRead on arxiv →
arxivMay 28bullish

LaneRoPE: Positional Encoding for Collaborative Parallel Reasoning and Generation

arXiv:2605.27570v1 Announce Type: new Abstract: Parallel LLM test-time scaling techniques (e.g., best-of-$N$) require drawing $N>1$ sequences conditioned on the same input prompt. These methods boost accuracy while exploiting the computational efficiency of batching $N$ generations. However, each se

LA1 model#research#llm#parallel-processingRead on arxiv →
arxivMay 27

Is Agent Memory a Database? Rethinking Data Foundations for Long-Term AI Agent Memory

arXiv:2605.26252v1 Announce Type: new Abstract: Long-running AI agents need persistent memory. Memory supports learning across sessions, reduces repeated context injection, and enables auditing of past decisions. Current agent memory systems and database paradigms treat memory as storage. They local

#artificial-intelligence#databases#memory-managementRead on arxiv →
arxivMay 22bullish

Towards Autonomous Mechanistic Reasoning in Virtual Cells

arXiv:2604.11661v3 Announce Type: replace-cross Abstract: Large language models (LLMs) have recently gained significant attention as a promising approach to accelerate scientific discovery. However, their application in open-ended scientific domains such as biology remains limited, primarily due to

#machine-learning#artificial-intelligence#biologyRead on arxiv →
arxivMay 21

EMO-BOOST: Emotion-Augmented Audio-Visual Features for Improved Generalization in Deepfake Detection

arXiv:2605.19630v1 Announce Type: new Abstract: With every advancement in generative AI models, forensics is under increasing pressure. The constant emergence of new generation techniques makes it impossible to collect data for each manipulation to train a deepfake detection model. Thus, generalizin

EMEM2 models#deepfakes#detection#researchRead on arxiv →
arxivMay 16

MediaClaw: Multimodal Intelligent-Agent Platform Technical Report

arXiv:2605.14771v1 Announce Type: new Abstract: MediaClaw is a multimodal agent platform built on the OpenClaw ecosystem. Its core design follows a three-layer architecture of unified abstraction, pluginized extension, and workflow orchestration. The system is intended to address practical deploymen

#multimodal#architecture#artificial-intelligenceRead on arxiv →
arxivMay 16

Interestingness as an Inductive Heuristic for Future Compression Progress

arXiv:2605.14831v1 Announce Type: new Abstract: One of the bottlenecks on the way towards recursively self-improving systems is the challenge of interestingness: the ability to prospectively identify which tasks or data hold the potential for future progress. We formalize interestingness as an induc

#artificial-intelligence#machine-learning#complexity-theoryRead on arxiv →
arxivMay 16

Monitoring Data-aware Temporal Properties (Extended Version)

arXiv:2605.14666v1 Announce Type: new Abstract: Dynamic systems in AI are often complex and heterogeneous, so that an internal specification is not accessible and verification techniques such as model checking are not applicable. Monitoring is in such cases an attractive alternative, as it evaluates

#monitoring#verification#artificial-intelligenceRead on arxiv →
arxivMay 13bullish

Evidence Over Plans: Online Trajectory Verification for Skill Distillation

arXiv:2605.09192v1 Announce Type: new Abstract: Agent skills can remarkably improve task success rates by using human-written procedural documents, but their quality is difficult to assess without environment-grounded verification. Existing skill generation methods heavily rely on preference logs ra

#artificial-intelligence#skill-generation#distillationRead on arxiv →
arxivMay 11bullish

EviDep: Trustworthy Multimodal Depression Estimation via Disentangled Evidential Learning

arXiv:2604.16579v2 Announce Type: replace-cross Abstract: Automated multimodal depression estimation in unconstrained environments is inherently challenged by naturalistic noise and complex behavioral variability. Prevailing deterministic methods, however, produce uncalibrated point estimates withou

EV1 model#machine-learning#artificial-intelligence#mental-healthRead on arxiv →
arxivMay 8

SkillRet: A Large-Scale Benchmark for Skill Retrieval in LLM Agents

arXiv:2605.05726v1 Announce Type: new Abstract: As LLM agents are increasingly deployed with large libraries of reusable skills, selecting the right skill for a user request has become a critical systems challenge. In small libraries, users may invoke skills explicitly by name, but this assumption b

#benchmark#llm#retrievalRead on arxiv →
arxivMay 8bullish

GlazyBench: A Benchmark for Ceramic Glaze Property Prediction and Image Generation

arXiv:2605.06641v1 Announce Type: new Abstract: Developing ceramic glazes is a costly, time-consuming process of trial and error due to complex chemistry, placing a significant burden on independent artists. While recent advances in multimodal AI offer a modern solution, the field lacks the large-sc

#ceramic-glazes#material-design#artificial-intelligenceRead on arxiv →
arxivMay 8

BUILD-AND-FIND: An Effort-Aware Protocol for Evaluating Agent-Managed Codebases

arXiv:2605.06136v1 Announce Type: cross Abstract: Most coding-agent benchmarks ask whether generated code behaves correctly. That remains essential, but repository-level engineering is increasingly agent-managed: one agent writes a repository, and later agents inspect, audit, or extend it as working

#benchmark#software-engineering#artificial-intelligenceRead on arxiv →
arxivMay 8

Prediction and Empowerment: A Theory of Agency through Bridge Interfaces

arXiv:2605.06346v1 Announce Type: new Abstract: We study agency under partial observability in deterministic physical or simulated worlds, where apparent randomness arises from uncertainty over initial conditions, fixed law bits, and unrolled exogenous noise. We model sensing and actuation as bridge

#artificial-intelligence#research#deterministic-modelsRead on arxiv →
arxivMay 8

Discovering What You Can Control: Interventional Boundary Discovery for Reinforcement Learning

arXiv:2603.18257v2 Announce Type: replace-cross Abstract: When an RL agent's observations contain distractors driven by the same confounders as its true state, observational data alone cannot identify which dimensions the agent controls. In our benchmarks, even state-conditioned observational select

SA1 model#machine-learning#artificial-intelligence#reinforcement-learningRead on arxiv →
arxivMay 8

Structural Instability of Feature Composition

arXiv:2605.05223v1 Announce Type: cross Abstract: Sparse Autoencoders (SAEs) have emerged as a powerful paradigm for disentangling feature superposition in transformer-based architectures, enabling precise control via activation steering. However, the theoretical foundations of compositional steerin

#machine-learning#artificial-intelligence#researchRead on arxiv →
arxivApr 30bullish

Delineating Knowledge Boundaries for Honest Large Vision-Language Models

arXiv:2604.26419v1 Announce Type: cross Abstract: Large Vision-Language Models (VLMs) have achieved remarkable multimodal performance yet remain prone to factual hallucinations, particularly in long-tail or specialized domains. Moreover, current models exhibit a weak capacity to refuse queries that

#computer-vision#artificial-intelligence#trustworthinessRead on arxiv →
arxivApr 29

Latent-Hysteresis Graph ODEs: Modeling Coupled Topology-Feature Evolution via Continuous Phase Transitions

arXiv:2604.24293v1 Announce Type: cross Abstract: Graph neural ordinary differential equations (Graph ODEs) extend graph learning from discrete message-passing layers to continuous-time representation flows. While it supports adaptive long-range propagation, we show that Graph ODEs with strictly pos

GRHY2 models#graph-learning#machine-learning#artificial-intelligenceRead on arxiv →
arxivApr 24bullish

The Last Harness You'll Ever Build

arXiv:2604.21003v1 Announce Type: new Abstract: AI agents are increasingly deployed on complex, domain-specific workflows -- navigating enterprise web applications that require dozens of clicks and form fills, orchestrating multi-step research pipelines that span search, extraction, and synthesis, a

#automation#meta-learning#artificial-intelligenceRead on arxiv →
arxivApr 24bearish

Brief chatbot interactions produce lasting changes in human moral values

arXiv:2604.21430v1 Announce Type: new Abstract: Moral judgements form the foundation of human social behavior and societal systems. While Artificial Intelligence chatbots increasingly serve as personal advisors, their influence on moral judgments remains largely unexplored. Here, we examined whether

#artificial-intelligence#ethics#manipulationRead on arxiv →
arxivApr 24

Post-AGI Economies: Autonomy and the First Fundamental Theorem of Welfare Economics

arXiv:2604.21216v1 Announce Type: cross Abstract: The First Fundamental Theorem of Welfare Economics assumes that welfare-bearing agents are autonomous and implicitly relies on a binary distinction between autonomy and instrumentality. Welfare subjects are those who have autonomy and therefore the c

#economics#autonomy#artificial-intelligenceRead on arxiv →
arxivApr 24

Reasoning on the Manifold: Bidirectional Consistency for Self-Verification in Diffusion Language Models

arXiv:2604.16565v2 Announce Type: replace-cross Abstract: While Diffusion Large Language Models (dLLMs) offer structural advantages for global planning, efficiently verifying that they arrive at correct answers via valid reasoning traces remains a critical challenge. In this work, we propose a geome

#machine-learning#artificial-intelligence#researchRead on arxiv →
arxivApr 24

Formalising the Logit Shift Induced by LoRA: A Technical Note

arXiv:2604.20313v1 Announce Type: new Abstract: This technical note provides a first-order formalisation of the logit shift and fact-margin change induced by Low-Rank Adaptation (LoRA). Using a first-order Fr\'echet approximation around the base model trajectory, we show that the multi-layer LoRA ef

LO1 model#machine-learning#artificial-intelligence#researchRead on arxiv →
arxivApr 24bullish

GS-Quant: Granular Semantic and Generative Structural Quantization for Knowledge Graph Completion

arXiv:2604.21649v1 Announce Type: new Abstract: Large Language Models (LLMs) have shown immense potential in Knowledge Graph Completion (KGC), yet bridging the modality gap between continuous graph embeddings and discrete LLM tokens remains a critical challenge. While recent quantization-based appro

#knowledge-graph#natural-language-processing#quantizationRead on arxiv →
arxivApr 23bullish

DISCA: A Digital In-memory Stochastic Computing Architecture Using A Compressed Bent-Pyramid Format

arXiv:2511.17265v2 Announce Type: replace-cross Abstract: Nowadays, we are witnessing an Artificial Intelligence revolution that dominates the technology landscape in various application domains, such as healthcare, robotics, automotive, security, and defense. Massive-scale AI models, which mimic th

#hardware#artificial-intelligence#edge-computingRead on arxiv →
arxivApr 21

Experience Compression Spectrum: Unifying Memory, Skills, and Rules in LLM Agents

arXiv:2604.15877v1 Announce Type: new Abstract: As LLM agents scale to long-horizon, multi-session deployments, efficiently managing accumulated experience becomes a critical bottleneck. Agent memory systems and agent skill discovery both address this challenge -- extracting reusable knowledge from

#artificial-intelligence#multiagent-systems#knowledge-managementRead on arxiv →
arxivApr 17

Between a Rock and a Hard Place: The Tension Between Ethical Reasoning and Safety Alignment in LLMs

arXiv:2509.05367v4 Announce Type: replace-cross Abstract: Large Language Model safety alignment predominantly operates on a binary assumption that requests are either safe or unsafe. This classification proves insufficient when models encounter ethical dilemmas, where the capacity to reason through

#safety#security#cryptographyRead on arxiv →
arxivApr 17

Integration of Deep Reinforcement Learning and Agent-based Simulation to Explore Strategies Counteracting Information Disorder

arXiv:2604.13047v1 Announce Type: cross Abstract: In recent years, the spread of fake news has triggered a growing interest in Information Disorders (ID) on social media, a phenomenon that has become a focal point of research across fields ranging from complexity theory and computer science to cogni

AGDE2 models#misinformation#social-simulation#artificial-intelligenceRead on arxiv →
arxivApr 16bullish

Public Profile Matters: A Scalable Integrated Approach to Recommend Citations in the Wild

arXiv:2603.17361v2 Announce Type: replace-cross Abstract: Proper citation of relevant literature is essential for contextualising and validating scientific contributions. While current citation recommendation systems leverage local and global textual information, they often overlook the nuances of t

PRDA2 models#information-retrieval#citation-recommendation#artificial-intelligenceRead on arxiv →
arxivApr 16

PrivacyReasoner: Can LLM Emulate a Human-like Privacy Mind?

arXiv:2601.09152v2 Announce Type: replace Abstract: Prior work on LLM-based privacy focuses on norm judgment over synthetic vignettes, rather than how people think about a specific data practice and formulate their opinions. We address this gap by designing PrivacyReasoner, an agent architecture gro

LLPR2 models#privacy#llm#artificial-intelligenceRead on arxiv →
arxivApr 16

Fully Homomorphic Encryption on Llama 3 model for privacy preserving LLM inference

arXiv:2604.12168v1 Announce Type: cross Abstract: The applications of Generative Artificial Intelligence (GenAI) and their intersections with data-driven fields, such as healthcare, finance, transportation, and information security, have led to significant improvements in service efficiency and low

DE1 model#security#cryptography#homomorphic-encryptionRead on arxiv →
arxivApr 16bullish

Human-Centric Topic Modeling with Goal-Prompted Contrastive Learning and Optimal Transport

arXiv:2604.12663v1 Announce Type: new Abstract: Existing topic modeling methods, from LDA to recent neural and LLM-based approaches, which focus mainly on statistical coherence, often produce redundant or off-target topics that miss the user's underlying intent. We introduce Human-centric Topic Mode

GCLL2 models#topic-modeling#natural-language-processing#artificial-intelligenceRead on arxiv →
arxivApr 16

Efficiency of Proportional Mechanisms in Online Auto-Bidding Advertising

arXiv:2604.12799v1 Announce Type: cross Abstract: The rise of automated bidding strategies in online advertising presents new challenges in designing and analyzing efficient auction mechanisms. In this paper, we focus on proportional mechanisms within the context of auto-bidding and study the effici

#auction-mechanisms#game-theory#artificial-intelligenceRead on arxiv →
arxivApr 14bullish

Active Inference with a Self-Prior in the Mirror-Mark Task

arXiv:2604.09673v1 Announce Type: cross Abstract: The mirror self-recognition test evaluates whether a subject touches a mark on its own body that is visible only in a mirror, and is widely used as an indicator of self-awareness. In this study, we present a computational model in which this behavior

TR1 model#self-awareness#machine-learning#artificial-intelligenceRead on arxiv →
arxivApr 14

Seven simple steps for log analysis in AI systems

arXiv:2604.09563v1 Announce Type: new Abstract: AI systems produce large volumes of logs as they interact with tools and users. Analysing these logs can help understand model capabilities, propensities, and behaviours, or assess whether an evaluation worked as intended. Researchers have started deve

#log-analysis#research#artificial-intelligenceRead on arxiv →
arxivApr 13bullish

Memory Intelligence Agent

arXiv:2604.04503v3 Announce Type: replace Abstract: Deep research agents (DRAs) integrate LLM reasoning with external tools. Memory systems enable DRAs to leverage historical experiences, which are essential for efficient reasoning and autonomous evolution. Existing methods rely on retrieving simila

ME1 model#artificial-intelligence#multiagent-systems#memory-intelligenceRead on arxiv →
arxivApr 13bullish

Litmus (Re)Agent: A Benchmark and Agentic System for Predictive Evaluation of Multilingual Models

arXiv:2604.08970v1 Announce Type: cross Abstract: We study predictive multilingual evaluation: estimating how well a model will perform on a task in a target language when direct benchmark results are missing. This problem is common in multilingual deployment, where evaluation coverage is sparse and

LI1 model#multilingual#evaluation#benchmarkRead on arxiv →
arxivApr 10bullish

The Art of Building Verifiers for Computer Use Agents

arXiv:2604.06240v1 Announce Type: cross Abstract: Verifying the success of computer use agent (CUA) trajectories is a critical challenge: without reliable verification, neither evaluation nor training signal can be trusted. In this paper, we present lessons learned from building a best-in-class veri

UNWEWE3 models#verification#evaluation#artificial-intelligenceRead on arxiv →
arxivApr 10bullish

Information as Structural Alignment: A Dynamical Theory of Continual Learning

arXiv:2604.07108v1 Announce Type: cross Abstract: Catastrophic forgetting is not an engineering failure. It is a mathematical consequence of storing knowledge as global parameter superposition. Existing methods, such as regularization, replay, and frozen subnetworks, add external mechanisms to a sha

STGO2 models#continual-learning#catastrophic-forgetting#machine-learningRead on arxiv →
arxivApr 7bullish

Agentization of Digital Assets for the Agentic Web: Concepts, Techniques, and Benchmark

arXiv:2604.04226v1 Announce Type: cross Abstract: Agentic Web, as a new paradigm that redefines the internet through autonomous, goal-driven interactions, plays an important role in group intelligence. As the foundational semantic primitives of the Agentic Web, digital assets encapsulate interactive

#multiagent#artificial-intelligence#benchmarkRead on arxiv →
arxivApr 3bullish

Prompt-Guided Prefiltering for VLM Image Compression

arXiv:2604.00314v1 Announce Type: cross Abstract: The rapid progress of large Vision-Language Models (VLMs) has enabled a wide range of applications, such as image understanding and Visual Question Answering (VQA). Query images are often uploaded to the cloud, where VLMs are typically hosted, hence

#image-compression#vision-language#efficiencyRead on arxiv →
HomeModelsNews