Ramp Labs proposes a new solution for shared multi-agent memory, with the highest Token consumption reduced by 65%

GateNews

Gate News message, April 11, AI infrastructure company Ramp Labs released research findings called “Latent Briefing,” enabling efficient memory sharing among multi-agent systems by directly compressing large-model KV caches, greatly reducing Token consumption without losing accuracy. In mainstream multi-agent architectures, the orchestrator breaks down tasks and repeatedly calls worker model instances; as the inference chain grows longer, Token usage expands exponentially. The core idea behind Latent Briefing is to use the attention mechanism to identify the truly crucial parts of the context, discard redundant information directly at the representation layer, rather than relying on slow LLM summarization or RAG retrieval with less stable results. On the LongBench v2 benchmark, the method performed impressively: the worker model’s Token consumption dropped by 65%, the Token savings’ median for medium-length documents (32k to 100k) reached 49%, overall accuracy improved by about 3 percentage points versus the baseline, and the additional time spent per compression was only about 1.7 seconds—roughly a 20x speedup compared with the original algorithm. The experiments used Claude Sonnet 4 as the orchestrator and Qwen3-14B as the worker model, covering a wide range of document scenarios including academic papers, legal documents, novels, and government reports. The study also found that the optimal compression threshold varies with task difficulty and document length—hard problems are better suited to aggressive compression to filter speculative reasoning noise, while long documents are better suited to lighter compression to preserve dispersed key information.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

OpenAI Recruits Top Enterprise Software Talent as Frontier Agents Disrupt Industry

Gate News message, April 26 — OpenAI and Anthropic have been recruiting senior executives and specialized engineers from major enterprise software companies including Salesforce, Snowflake, Datadog, and Palantir. Denise Dresser, former CEO of Slack under Salesforce, joined OpenAI as chief revenue of

GateNews1h ago

Worxphere Rebrands JobKorea With AI-Powered Hiring Tools

Gate News message, April 26 — South Korean HR platform Worxphere has rebranded JobKorea as it transitions from traditional online job boards to AI-driven hiring solutions. The company is consolidating services including JobKorea and Albamon into a unified platform covering permanent employment,

GateNews11h ago

AI Agents can already independently recreate complex academic papers: Mollick says most errors come from human original text rather than AI

Mollick points out that publicly available methods and data can allow AI agents to reproduce complex research without the original paper and code; if the reproduction does not match the original paper, it is usually due to errors in the paper’s own data processing or overextension of the conclusions, rather than the AI. Claude first reproduces the paper, and then GPT‑5 Pro cross-validates it; most attempts succeed, but they are blocked when the data is too large or when there are issues with the replication data. This trend greatly reduces labor costs, making reproduction a widely actionable form of verification, and it also raises institutional challenges for peer review and governance, with government governance tools or becoming a key issue.

ChainNewsAbmedia12h ago

UAE Announces Shift Toward AI Government Model in the Next Two Years

His Highness Sheikh Mohammed bin Rashid Al Maktoum stated that the goal was for 50% of government sectors to operate through autonomous agentic AI. The transition will also include the training of federal employees to “master AI” and will be overseen by Sheikh Mansour bin Zayed. Key Takeaways:

Coinpedia04-25 08:39

AI Trading Platform Fere AI Raises $1.3M in Funding Led by Ethereal Ventures

Gate News message, April 25 — Fere AI, an AI-powered digital asset trading platform, announced the completion of a $1.3 million funding round led by Ethereal Ventures, with participation from Galaxy Vision Hill and Kosmos Ventures, according to Globenewswire. The platform supports cross-chain

GateNews04-25 07:46

Nvidia Deploys OpenAI Codex AI Agent Across Entire Workforce on Blackwell Infrastructure

Gate News message, April 25 — Nvidia has rolled out OpenAI's Codex, an AI agent powered by GPT-5.5, to its entire workforce following a successful trial with approximately 10,000 employees, according to internal communications from CEO Jensen Huang and OpenAI CEO Sam Altman. Codex is designed to as

GateNews04-25 03:11
Comment
0/400
No comments