News
AI Can Now Do Expert-Level Work (Almost). 5 Surprising Findings from a Landmark...
OpenAI has released a benchmark for evaluating AI on complex professional tasks. The results show that the best AI models are begi...
The Vulnerability of Autonomous Driving to Typographic Attacks: Transferability...
This article reviews and compares two major types of adversarial attacks against neural networks: gradient-based methods (like PGD...
LLMs + Vector Databases: Building Memory Architectures for AI Agents
The 128k token limit for GPT-4 is equivalent to about 96,000 words. This limitation becomes a major barrier for a research assista...
The Integration of Vision-LLMs into AD Systems: Capabilities and Challenges
This article reviews the development and application of Vision-Large-Language-Models, focusing on their integration into autonomou...
Gemini Might Be the ONLY Actual Foundational Model Out There
ChatGPT has been in beta for a year, but the latest updates have made it seem like a "genius being slowly lobotomized for public s...
The Future of Learning is Here: Google’s Learn Your Way Revolutionizes Textbooks...
Google’s “Learn Your Way,” now available on Google Labs, is a research experiment that leverages generative AI (GenAI) to transfor...
How People Use ChatGPT
A groundbreaking NBER Working Paper, “How People Use ChatGPT”, finally pulls back the curtain on this phenomenon. This comprehensi...
The Unseen Variable: Why Your LLM Gives Different Answers (and How We Can Fix It...
The same prompt, run multiple times, can produce entirely different outputs. This isn’t just a quirk of “probabilistic” AI; it’s a...
Generative AI: Is It Moving From Large Language Models to Small Languge Models?
Large language models (LLMs) have played a pivotal role in the significant growth witnessed by GenAI. But LLMs come with a number...
The Paradox of Brilliance: Why Our Smartest AI Still “Bluffs” And How We Can Tea...
OpenAI: Are our most advanced AI systems secretly bluffing? This isn’t a rhetorical question, but a critical challenge underpinnin...
Beyond the Hype: How Small Language Models and Knowledge Graphs are Redefining D...
The paper establishes the importance of a combination of Small Language Models (SLMs) with their smallness and modularity in contr...
From Headlines to Digests: How Agents Personalize the Firehose
From firehose to digest: how multi-agent systems, guided by MCP and grounded in fundamentals, can transform any feed into personal...
