Category: Blog

  • When PDFs Fight Back: Converting Complex Documents for Information Diffusion

    Or: How I learned to stop trusting automated tools and verify everything The Experiment: Unlocking the Value in Long-Form PDFs We’ve all been there: you have a comprehensive 200-page PDF—a research report, technical manual, or detailed analysis—packed with valuable information. But actually using it? Nearly impossible. No search that works well, no way to jump…

  • From ‘Catching Bad Words’ to ‘Understanding Bad Intent’: AI Safety’s Next Evolution

    As Large Language Models (LLMs) like Claude and GPT-4 become central to our digital lives, a silent arms race is happening behind the scenes. On one side, “jailbreakers” try to trick AI into bypassing its safety filters; on the other, researchers build shields to keep the AI helpful and harmless. The recent paper “Constitutional Classifiers++:…

  • From Months to Days: AI-Assisted Peer Review with Human Oversight

    The Breaking Point Your research center issues a call for papers on a pressing global challenge. Within weeks, 500 submissions flood in, each representing months or years of scholarly work, each deserving careful consideration. Then reality hits. You have perhaps a dozen qualified reviewers, most already overcommitted. Traditional peer review would demand thousands of person-hours…

  • LLMBase: One API, Many AI Models

    I built a tool to make working with multiple AI models easier, and it’s now available on GitHub: https://github.com/ngstcf/llmbase Dealing with API Fragmentation Like many developers experimenting with AI, I found myself wanting to try different models for different tasks. GPT-4o is great for general-purpose work, Claude Sonnet handles complex reasoning well, Gemini shines with…