research paper

AgentFold : What If AI Agents Managed Memory Like Humans Do?

AgentFold : What If AI Agents Managed Memory Like Humans Do?

Introduction If you've spent time working with LLM agents for web research, coding assistance in cursor or even extended conversations in ChatGPT, you've probably noticed something: as tasks or multi turn conversations grow longer and more complex, the quality of responses deteriorates - essentially because of

FastLongSpeech: 30x Compression That Doesn't Murder Your Context

FastLongSpeech: 30x Compression That Doesn't Murder Your Context

Speech models are having a moment and seems like they’re here to stay. They can transcribe your rambling, understand your questions, and even tell when you're being sarcastic. But ask them to process anything longer than a TikTok video and they straight-up collapse. The problem? Speech can

Chain-Talker: Teaching AI to Speak with Empathy

Chain-Talker: Teaching AI to Speak with Empathy

When AI systems engage in conversation, getting the words right is only half the battle. The real challenge is emotional appropriateness: responding to "I just lost my job" with genuine sympathy rather than robotic cheerfulness, or matching enthusiasm when someone shares good news. Current conversational speech synthesis (CSS)

DiTTo-TTS: The TTS System That Doesn't Need Your Phonemes (And Why That's a Big Deal)

DiTTo-TTS: The TTS System That Doesn't Need Your Phonemes (And Why That's a Big Deal)

Text-to-speech has always been that one AI domain where you couldn't just throw data at the problem and call it a day. “Data is the moat” is straight up not a thing here. Want to build a TTS system? Better get comfortable with phonemizers, forced aligners, duration predictors,

Are Small Language Models the Future of Agentic AI?

Are Small Language Models the Future of Agentic AI?

Recent research from NVIDIA presents a compelling argument that small language models (SLMs) represent the future of agentic artificial intelligence systems. The paper challenges the current industry paradigm of deploying large language models for all agent tasks, proposing instead that smaller, specialized models offer superior operational characteristics for most agentic

What GenSE Gets Right For LLM-Assisted Speech Enhancement 🎙️

What GenSE Gets Right For LLM-Assisted Speech Enhancement 🎙️

Speech enhancement, the art of cleaning up noisy, muffled, or degraded audio, has traditionally been the domain of signal processing algorithms and convolutional networks. But what if we treated speech cleaning like a language problem instead? That's exactly what researchers from Northwestern Polytechnical University and Nanyang Technological University

Best LLMs for Legal AI Agents: A Deep Dive into LegalBench Performance

Best LLMs for Legal AI Agents: A Deep Dive into LegalBench Performance

From contract analysis to legal research, from compliance monitoring to case preparation, artificial intelligence is transforming how legal professionals work. However, the stakes in legal practice are uniquely high. A single error can result in malpractice claims, regulatory violations, or adverse case outcomes. This reality makes choosing the right AI