Research

AI Research Radar

Track new AI papers, benchmark papers, citations, arXiv categories, Semantic Scholar metadata, and related GitHub links.

How to use this dashboard

Track new AI papers, benchmark papers, citations, arXiv categories, Semantic Scholar metadata, and related GitHub links.

Use this radar to find fresh papers, research topics, author clusters, and related GitHub implementations worth reading next.

AI Research Radar

16 records
2026-04-28Recursive Multi-Agent SystemsXiyuan Yang, Jiaru Zou, Rui Pan, Ruizhong Qiucs.AI, cs.CL, cs.LGAgentsAgent trend signalCheck Semantic ScholarGitHub searchRecursive or looped language models have recently emerged as a new scaling axis by iteratively refining the same model computation over latent states to deepen reasoning. We extend such scaling principle from …
2026-04-28DV-World: Benchmarking Data Visualization Agents in Real-World ScenariosJinxiang Meng, Shaoping Huang, Fangyu Lei, Jingyu Guocs.CLAgentsAgent trend signalCheck Semantic ScholarGitHub searchReal-world data visualization (DV) requires native environmental grounding, cross-platform evolution, and proactive intent alignment. Yet, existing benchmarks often suffer from code-sandbox confinement, single…
2026-04-28Make Any Collection Navigable: Methods for Constructing and Evaluating Hypergraph of TextDean E. Alvarez, ChengXiang Zhaics.IREvaluationBenchmark/eval signalCheck Semantic ScholarGitHub searchOne reason the Web is more useful than a simple collection of documents is that the structure created by hyperlinks enables flexible navigation from one web page to another. However, hyperlinks are typically c…
2026-04-28Carbon-Taxed Transformers: A Green Compression Pipeline for Overgrown Language ModelsAjmain Inqiad Alam, Palash Roy, Chanchal K. Roy, Banani Roycs.SE, cs.LGRAG / RetrievalResearch signalCheck Semantic ScholarGitHub searchThe accelerating adoption of Large Language Models (LLMs) in software engineering (SE) has brought with it a silent crisis: unsustainable computational cost. While these models demonstrate remarkable capabilit…
2026-04-28Pythia: Toward Predictability-Driven Agent-Native LLM ServingShan Yu, Junyi Shu, Yuanjiang Ni, Kun Qiancs.MA, cs.DC, eess.SYAgentsAgent trend signalCheck Semantic ScholarGitHub searchAs LLM applications grow more complex, developers are increasingly adopting multi-agent architectures to decompose workflows into specialized, collaborative components, introducing structure that constrains ag…
2026-04-28TSN-Affinity: Similarity-Driven Parameter Reuse for Continual Offline Reinforcement LearningDominik Żurek, Kamil Faber, Marcin Pietron, Paweł Gajewskics.LG, cs.AIEvaluationBenchmark/eval signalCheck Semantic ScholarGitHub searchContinual offline reinforcement learning (CORL) aims to learn a sequence of tasks from datasets collected over time while preserving performance on previously learned tasks. This setting corresponds to domains…
2026-04-28Three Models of RLHF Annotation: Extension, Evidence, and AuthoritySteve Coynecs.CY, cs.AI, cs.CLSafetyResearch signalCheck Semantic ScholarGitHub searchPreference-based alignment methods, most prominently Reinforcement Learning with Human Feedback (RLHF), use the judgments of human annotators to shape large language model behaviour. However, the normative rol…
2026-04-28Conditional misalignment: common interventions can hide emergent misalignment behind contextual triggersJan Dubiński, Jan Betley, Anna Sztyber-Betley, Daniel Tancs.LG, cs.AI, cs.CREvaluationBenchmark/eval signalCheck Semantic ScholarGitHub searchFinetuning a language model can lead to emergent misalignment (EM) [Betley et al., 2025b]. Models trained on a narrow distribution of misaligned behavior generalize to more egregious behaviors when tested outs…
2026-04-28Observation-Guided Neural Surrogate Learning for Scientific Simulation Emulation: A Single-Gauge Flood-Inundation Proof of ConceptMarzieh Alireza Mirhoseiniphysics.ao-phEvaluationBenchmark/eval signalCheck Semantic ScholarGitHub searchWe present an observation-guided neural surrogate-learning framework for scientific simulation emulation, demonstrated on urban flood-inundation mapping. The framework combines LISFLOOD-FP hydrodynamic simulat…
2026-04-28No Pedestrian Left Behind: Real-Time Detection and Tracking of Vulnerable Road Users for Adaptive Traffic Signal ControlAnas Gamal Aly, Hala ElAaragcs.CV, cs.AI, cs.RORAG / RetrievalResearch signalCheck Semantic ScholarGitHub searchCurrent pedestrian crossing signals operate on fixed timing without adjustment to pedestrian behavior, which can leave vulnerable road users (VRUs) such as the elderly, disabled, or distracted pedestrians stra…
2026-04-28MarkIt: Training-Free Visual Markers for Precise Video Temporal GroundingPengcheng Fang, Yuxia Chen, Xiaohao Caics.MMRAG / RetrievalResearch signalCheck Semantic ScholarGitHub searchVideo temporal grounding (VTG) aims to localize the start and end timestamps of the event described by a given query within an untrimmed video. Despite the strong open-world video understanding and recognition…
2026-04-28Explainable AI for Jet Tagging: A Comparative Study of GNNExplainer, GNNShap, and GradCAM for Jet Tagging in the Lund Jet PlanePahal D. Patel, Sanmay Gangulyhep-ph, cs.LG, hep-exRAG / RetrievalResearch signalCheck Semantic ScholarGitHub searchGraph neural networks such as ParticleNet and transformer based networks on point clouds such as ParticleTransformer achieve state-of-the-art performance on jet tagging benchmarks at the Large Hadron Collider,…
2026-04-28QCalEval: Benchmarking Vision-Language Models for Quantum Calibration Plot UnderstandingShuxiang Cao, Zijian Zhang, Abhishek Agarwal, Grace Bratrudquant-ph, cs.CVRAG / RetrievalResearch signalCheck Semantic ScholarGitHub searchQuantum computing calibration depends on interpreting experimental data, and calibration plots provide the most universal human-readable representation for this task, yet no systematic evaluation exists of how…
2026-04-28From Threads to Trajectories: A Multi-LLM Pipeline for Community Knowledge Extraction from GitHub Issue DiscussionsNazia Shehnaz Joynab, Soneya Binta Hossaincs.SEAgentsAgent trend signalCheck Semantic ScholarGitHub searchResolution of complex post-production issues in large-scale open-source software (OSS) projects requires significant cognitive effort, as developers need to go through long, unstructured and fragmented issue d…
2026-04-28When Errors Can Be Beneficial: A Categorization of Imperfect Rewards for Policy GradientShuning Shang, Hubert Strauss, Stanley Wei, Sanjeev Aroracs.LG, cs.AI, stat.MLEvaluationBenchmark/eval signalCheck Semantic ScholarGitHub searchTraining language models via reinforcement learning often relies on imperfect proxy rewards, since ground truth rewards that precisely define the intended behavior are rarely available. Standard metrics for as…
2026-04-28Twisted and Twisted Linearized Reed--Solomon Codes, LCD and ACD MDS constructionsSanjit Bhowmick, Kuntal Deka, Edgar Martínez-Morocs.ITEvaluationBenchmark/eval signalCheck Semantic ScholarGitHub searchWe investigate a natural subfamily of twisted linearized Reed--Solomon (TLRS) codes in the sum-rank metric, where the twist is applied only to the constant term. We establish a simple necessary and sufficient …