AI Signal · 2026-06-15

ARCHIVE · 10 SIGNALS Generated: 2026-06-15 15:43

Show HN: Pantheon – AI vs AI: one writes the code, the other attacks it

There's always a generous look at the code you've come up with. But the pantheon is different. The pantheon is made by turning multiple sub-agents, and when the scorer scores th...

Hacker News AI / 2026-06-15 14:20

#02

UP-NRPA: User Portrait based Nested Rollout Policy Adaptation for Planning with Large Language Models in Goal-oriented Dialogue Systems

arXiv:2606.13683v1 Announce Type: new Abstract: To address the challenge that current dialogue policy planning methods struggle to dynamically adapt to diverse user characterist...

arXiv AI / 2026-06-15 04:00

#03

Orchestra-o1: Omnimodal Agent Orchestration

arXiv:2606.13707v1 Announce Type: new Abstract: The recent success of agent swarms has shifted the paradigm of large language model (LLM)-based agents from single-agent workflow...

arXiv AI / 2026-06-15 04:00

#04

Hybrid Open-Ended Tri-Evolution Makes Better Deep Researcher

arXiv:2606.13710v1 Announce Type: new Abstract: Deep research and agent evolution serve as de-facto tasks for AI agents in real-world applications toward artificial general inte...

arXiv AI / 2026-06-15 04:00

#05

Benchmarking Web Agent Safety under E-commerce Deceptive Interfaces

arXiv:2606.13686v1 Announce Type: new Abstract: As autonomous web agents are increasingly deployed to perform real-world tasks, ensuring their safety has become a critical conce...

arXiv Computation and Language / 2026-06-15 04:00

#06

QIAS 2026: Overview of the Shared Task on Islamic Inheritance Reasoning

arXiv:2606.13756v1 Announce Type: new Abstract: This paper presents a comprehensive overview of the QIAS 2026 shared task, organized as part of the OSACT7 Workshop and co-locate...

arXiv Computation and Language / 2026-06-15 04:00

#07

The Culture Funnel: You Can't Align What isn't in the Data

arXiv:2606.13808v1 Announce Type: new Abstract: Current cultural alignment approaches focus on inference-time interventions, assuming models already contain sufficient cultural...

arXiv Computation and Language / 2026-06-15 04:00

#08

OpenAI frontier models and Codex are now available on AWS

OpenAI frontier models and Codex are now generally available on AWS, giving enterprises a new path to build with OpenAI through the AWS environments, controls, and procurement w...

OpenAI News / 2026-06-01 10:00

#09

Databricks brings GPT-5.5 to enterprise agent workflows

Databricks uses GPT-5.5 for enterprise agent workflows after the model set a new state of the art on the OfficeQA Pro benchmark.

OpenAI News / 2026-05-15 00:00

#10

Introducing GPT-5.4 mini and nano

GPT-5.4 mini and nano are smaller, faster versions of GPT-5.4 optimized for coding, tool use, multimodal reasoning, and high-volume API and sub-agent workloads.

OpenAI News / 2026-03-17 10:00