Close Menu
geekfence.comgeekfence.com
    What's Hot

    A new era of humanoid robots

    February 15, 2026

    LLaMA in R with Keras and TensorFlow

    February 15, 2026

    How Deutsche Börse Federates Data Governance with Control

    February 15, 2026
    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    Facebook Instagram
    geekfence.comgeekfence.com
    • Home
    • UK Tech News
    • AI
    • Big Data
    • Cyber Security
      • Cloud Computing
      • iOS Development
    • IoT
    • Mobile
    • Software
      • Software Development
      • Software Engineering
    • Technology
      • Green Technology
      • Nanotechnology
    • Telecom
    geekfence.comgeekfence.com
    Home»Artificial Intelligence»Senior Director of Million-Dollar Regexes – O’Reilly
    Artificial Intelligence

    Senior Director of Million-Dollar Regexes – O’Reilly

    AdminBy AdminDecember 2, 2025No Comments2 Mins Read2 Views
    Facebook Twitter Pinterest LinkedIn Telegram Tumblr Email
    Senior Director of Million-Dollar Regexes – O’Reilly
    Share
    Facebook Twitter LinkedIn Pinterest Email



    The following article originally appeared on Medium and is being republished here with the author’s permission.

    Don’t get me wrong, I’m up all night using these tools.

    But I also sense we’re heading for an expensive hangover. The other day, a colleague told me about a new proposal to route a million documents a day through a system that identifies and removes Social Security numbers.

    I joked that this was going to be a “million-dollar regular expression.”

    Run the math on the “naïve” implementation with full GPT-5 and it’s eye-watering: A million messages a day at ~50K characters each works out to around 12.5 billion tokens daily, or $15,000 a day at current pricing. That’s nearly $6 million a year to check for Social Security numbers. Even if you migrate to GPT-5 Nano, you still spend about $230,000 a year.

    That’s a success. You “saved” $5.77 million a year…

    How about running this code for a million documents a day? How much would this cost:

    import re; s = re.sub(r”\b\d{3}[- ]?\d{2}[- ]?\d{4}\b”, “[REDACTED]”, s)

    A plain old EC2 instance could handle this… A single EC2 instance—something like an m1.small at 30 bucks a month—could churn through the same workload with a regex and cost you a few hundred dollars a year.

    Which means that in practice, companies will be calling people like me in a year saying, “We’re burning a million dollars to do something that should cost a fraction of that—can you fix it?”

    From $15,000/day to $0.96/day—I do think we’re about to see a lot of companies realize that a thinking model connected to an MCP server is way more expensive than just paying someone to write a bash script. Starting now, you’ll be able to make a career out of un-LLM-ifying applications.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    LLaMA in R with Keras and TensorFlow

    February 15, 2026

    ALS stole this musician’s voice. AI let him sing again.

    February 14, 2026

    The Future of Agentic Coding – O’Reilly

    February 13, 2026

    Maximizing throughput with time-varying capacity

    February 12, 2026

    80% of Fortune 500 use active AI Agents: Observability, governance, and security shape the new frontier

    February 11, 2026

    3 Questions: Using AI to help Olympic skaters land a quint | MIT News

    February 10, 2026
    Top Posts

    Hard-braking events as indicators of road segment crash risk

    January 14, 202617 Views

    Understanding U-Net Architecture in Deep Learning

    November 25, 202512 Views

    How to integrate a graph database into your RAG pipeline

    February 8, 20268 Views
    Don't Miss

    A new era of humanoid robots

    February 15, 2026

    Synopsys ramps up work around physical AI to help build adaptive machines capable of sensing…

    LLaMA in R with Keras and TensorFlow

    February 15, 2026

    How Deutsche Börse Federates Data Governance with Control

    February 15, 2026

    The data behind the design: How Pantone built agentic AI with an AI-ready database

    February 15, 2026
    Stay In Touch
    • Facebook
    • Instagram
    About Us

    At GeekFence, we are a team of tech-enthusiasts, industry watchers and content creators who believe that technology isn’t just about gadgets—it’s about how innovation transforms our lives, work and society. We’ve come together to build a place where readers, thinkers and industry insiders can converge to explore what’s next in tech.

    Our Picks

    A new era of humanoid robots

    February 15, 2026

    LLaMA in R with Keras and TensorFlow

    February 15, 2026

    Subscribe to Updates

    Please enable JavaScript in your browser to complete this form.
    Loading
    • About Us
    • Contact Us
    • Disclaimer
    • Privacy Policy
    • Terms and Conditions
    © 2026 Geekfence.All Rigt Reserved.

    Type above and press Enter to search. Press Esc to cancel.