googleJul 1bullish

New York City educators and industry leaders gathered at Google’s offices to shape the future of AI in classrooms.

<img src="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/Summit_Photo_1.max-600x600.format-webp.webp">Google, the New York Jobs CEO Council and Urban Assembly hosted an AI summit for 150 education and industry leaders.

NOGO2 models #education #ai-literacy #industry-partnership

arxivJun 17bullish

LLM-as-Judge in Education: A Curriculum-Grounded Marking Pipeline

arXiv:2606.17507v1 Announce Type: new Abstract: Generative AI and large language models (LLMs) are increasingly applied to question generation and automated assessment. However, deploying LLMs in preparation for high-stakes exams requires more than prompt engineering; it demands software pipelines t

LL1 model #education #assessment #language-models Read on arxiv →

openaiJun 12bullish

New OpenAI Academy courses for the next era of work

OpenAI introduces three Academy courses that help people build practical AI skills, create repeatable workflows, and apply agents in everyday work.

#education #ai-skills #workforce Read on openai →

arxivJun 10bearish

RealMath-Eval: Why SOTA Judges Struggle with Real Human Reasoning

arXiv:2606.10254v1 Announce Type: new Abstract: While Large Language Models (LLMs) have achieved near-perfect performance in \emph{solving} high-school mathematics, their ability to \emph{evaluate} the diverse reasoning processes of real human students remains under-examined. To bridge this gap, we

LA1 model #evaluation #benchmark #mathematics Read on arxiv →

arxivJun 5bearish

Personality Shapes Gender Bias in Persona-Conditioned LLM Narratives Across English and Hindi: An Empirical Investigation

arXiv:2604.23600v2 Announce Type: replace Abstract: Large Language Models (LLMs) are increasingly deployed in persona-driven applications such as education, customer service, and social platforms, where models are prompted to adopt specific personas when interacting with users. While persona conditi

LL1 model #bias #language-models #stereotypes Read on arxiv →

arxivMay 29

Error as a Lens: Probing LLM Reasoning through Synthetic Misconception Generation

arXiv:2605.29007v1 Announce Type: new Abstract: Personalized tutoring, teacher training, and education research need access to \emph{targeted} synthetic misconceptions, but privacy and IRB constraints make labelled corpora of real student errors scarce. LLMs could in principle generate synthetic err

LL1 model #education #synthetic-data #language-models Read on arxiv →

arxivMay 29

Slide Deck Q&A Quality Assurance App: A Multi-Stage Pipeline for Pedagogical Question Generation

arXiv:2605.26428v2 Announce Type: replace Abstract: Generating high-quality, pedagogically useful questions from lecture slide decks is difficult because important instructional content is distributed across both text and visual elements, and because useful questions must be scaffolded across the fl

#education #nlp #question-generation Read on arxiv →

arxivMay 28

Memory-Based vs. Context-Only Conditioning Produces Distinct Behavioral Patterns in Stateful Personalization

arXiv:2605.27389v1 Announce Type: cross Abstract: We study how conditioning context shapes personalization behavior in a teacher-facing educational recommender system. We compare contextual conditioning based on the current student question with memory-based conditioning using persistent learner inf

#personalization #education #recommender-systems Read on arxiv →

arxivMay 27

Are Video Models Zero-Shot Learners and Reasoners in Education? EduVideoBench, A Knowledge-Skills-Attitude Benchmark for Educational Video Generation

arXiv:2605.26918v1 Announce Type: new Abstract: Video generation models (VGMs) are rapidly entering classrooms, yet existing benchmarks evaluate only perceptual quality, intrinsic faithfulness, generic safety, or video as a reasoning medium, and none assesses whether the outputs are educationally va

#education #benchmark #video-generation Read on arxiv →

arxivMay 14bullish

Teaching Language Models How to Code Like Learners: Conversational Serialization for Student Simulation

arXiv:2604.10720v2 Announce Type: replace Abstract: Artificial students -- models that simulate how learners act and respond within educational systems -- are a promising tool for evaluating tutoring strategies and feedback mechanisms at scale. However, most existing approaches rely on prompting lar

QW1 model #open-source #education #programming Read on arxiv →

arxivMay 7bullish

AI Advocate: Educational Path to Transform Squads to the Future

arXiv:2605.03800v1 Announce Type: cross Abstract: This paper analyzes the strategic education process aimed at transitioning traditional software development squads into hybrid structures centered on collaborative work between humans and Artificial Intelligence (AI). In a context where human-AI coll

#collaboration #education #software-engineering Read on arxiv →

arxivApr 14bullish

ACE-TA: An Agentic Teaching Assistant for Grounded Q&A, Quiz Generation, and Code Tutoring

arXiv:2604.09572v1 Announce Type: cross Abstract: We introduce ACE-TA, the Agentic Coding and Explanations Teaching Assistant framework, that autonomously routes conceptual queries drawn from programming course material to grounded Q&A, stepwise coding guidance, and automated quiz generation using p

LA1 model #education #programming #language-models Read on arxiv →

arxivApr 13bullish

ConvoLearn: A Learning Sciences Grounded Dataset for Fine-Tuning Dialogic AI Tutors

arXiv:2601.08950v4 Announce Type: replace Abstract: Despite their growing adoption in education, LLMs remain misaligned with the core principle of effective tutoring: the dialogic construction of knowledge. We introduce ConvoLearn, a dataset of 2,134 semi-synthetic tutor-student dialogues operationa

MI1 model #education #dialogic #tutoring Read on arxiv →

arxivApr 11bullish

Behavior-Aware Item Modeling via Dynamic Procedural Solution Representations for Knowledge Tracing

arXiv:2604.08260v1 Announce Type: new Abstract: Knowledge Tracing (KT) aims to predict learners' future performance from past interactions. While recent KT approaches have improved via learning item representations aligned with Knowledge Components, they overlook the procedural dynamics of problem s

BA1 model #education #knowledge-tracing #language-models Read on arxiv →

arxivApr 10

Chatbot-Based Assessment of Code Understanding in Automated Programming Assessment Systems

arXiv:2604.07304v1 Announce Type: cross Abstract: Large Language Models (LLMs) challenge conventional automated programming assessment because students can now produce functionally correct code without demonstrating corresponding understanding. This paper makes two contributions. First, it reports a

LA1 model #education #programming #assessment Read on arxiv →

arxivApr 10bullish

ConvoLearn: A Dataset for Fine-Tuning Dialogic AI Tutors

arXiv:2601.08950v3 Announce Type: replace Abstract: Despite their growing adoption in education, LLMs remain misaligned with the core principle of effective tutoring: the dialogic construction of knowledge. We introduce CONVOLEARN1, a dataset of 2,134 semi-synthetic tutor-student dialogues operation

MI1 model #education #tutoring #dialogic Read on arxiv →

arxivApr 7bullish

Sandpiper: Orchestrated AI-Annotation for Educational Discourse at Scale

arXiv:2603.08406v2 Announce Type: replace-cross Abstract: Digital educational environments are expanding toward complex AI and human discourse, providing researchers with an abundance of data that offers deep insights into learning and instructional processes. However, traditional qualitative analys

LA1 model #education #research #qualitative-analysis Read on arxiv →

arxivApr 7

Pedagogical Safety in Educational Reinforcement Learning: Formalizing and Detecting Reward Hacking in AI Tutoring Systems

arXiv:2604.04237v1 Announce Type: cross Abstract: Reinforcement learning (RL) is increasingly used to personalize instruction in intelligent tutoring systems, yet the field lacks a formal framework for defining and evaluating pedagogical safety. We introduce a four-layer model of pedagogical safety

#education #reinforcement-learning #safety Read on arxiv →