Close Menu
geekfence.comgeekfence.com
    What's Hot

    Agentic AI is not the starting point, it’s the destination 

    May 11, 2026

    TX Fiber taps Vecima for XGS-PON roll out

    May 11, 2026

    From Planning to Action: SAP Enterprise Planning enhanced by DataRobot

    May 11, 2026
    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    Facebook Instagram
    geekfence.comgeekfence.com
    • Home
    • UK Tech News
    • AI
    • Big Data
    • Cyber Security
      • Cloud Computing
      • iOS Development
    • IoT
    • Mobile
    • Software
      • Software Development
      • Software Engineering
    • Technology
      • Green Technology
      • Nanotechnology
    • Telecom
    geekfence.comgeekfence.com
    Home»Artificial Intelligence»Senior Director of Million-Dollar Regexes – O’Reilly
    Artificial Intelligence

    Senior Director of Million-Dollar Regexes – O’Reilly

    AdminBy AdminDecember 2, 2025No Comments2 Mins Read2 Views
    Facebook Twitter Pinterest LinkedIn Telegram Tumblr Email
    Senior Director of Million-Dollar Regexes – O’Reilly
    Share
    Facebook Twitter LinkedIn Pinterest Email



    The following article originally appeared on Medium and is being republished here with the author’s permission.

    Don’t get me wrong, I’m up all night using these tools.

    But I also sense we’re heading for an expensive hangover. The other day, a colleague told me about a new proposal to route a million documents a day through a system that identifies and removes Social Security numbers.

    I joked that this was going to be a “million-dollar regular expression.”

    Run the math on the “naïve” implementation with full GPT-5 and it’s eye-watering: A million messages a day at ~50K characters each works out to around 12.5 billion tokens daily, or $15,000 a day at current pricing. That’s nearly $6 million a year to check for Social Security numbers. Even if you migrate to GPT-5 Nano, you still spend about $230,000 a year.

    That’s a success. You “saved” $5.77 million a year…

    How about running this code for a million documents a day? How much would this cost:

    import re; s = re.sub(r”\b\d{3}[- ]?\d{2}[- ]?\d{4}\b”, “[REDACTED]”, s)

    A plain old EC2 instance could handle this… A single EC2 instance—something like an m1.small at 30 bucks a month—could churn through the same workload with a regex and cost you a few hundred dollars a year.

    Which means that in practice, companies will be calling people like me in a year saying, “We’re burning a million dollars to do something that should cost a fraction of that—can you fix it?”

    From $15,000/day to $0.96/day—I do think we’re about to see a lot of companies realize that a thinking model connected to an MCP server is way more expensive than just paying someone to write a bash script. Starting now, you’ll be able to make a career out of un-LLM-ifying applications.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    From Planning to Action: SAP Enterprise Planning enhanced by DataRobot

    May 11, 2026

    Best Executive Programs to Build AI Leadership Across Business, Marketing, and Technology 

    May 10, 2026

    Posit AI Blog: AO, NAO, ENSO: A wavelet analysis example

    May 9, 2026

    The Download: AI malaise and babymaking tech

    May 8, 2026

    The Best Risk Mitigation Strategy in Data? A Single Source of Truth – O’Reilly

    May 7, 2026

    Catalyzing scientific impact through global partnerships and open resources

    May 6, 2026
    Top Posts

    Understanding U-Net Architecture in Deep Learning

    November 25, 202539 Views

    Hard-braking events as indicators of road segment crash risk

    January 14, 202627 Views

    Redefining AI efficiency with extreme compression

    March 25, 202626 Views
    Don't Miss

    Agentic AI is not the starting point, it’s the destination 

    May 11, 2026

    I recently spoke with a solo-practitioner attorney who told me she had “implemented Artificial Intelligence (AI) everywhere.” Given how aggressively…

    TX Fiber taps Vecima for XGS-PON roll out

    May 11, 2026

    From Planning to Action: SAP Enterprise Planning enhanced by DataRobot

    May 11, 2026

    How to Fix Your Claim Denial Rate with Expert Outsourcing

    May 11, 2026
    Stay In Touch
    • Facebook
    • Instagram
    About Us

    At GeekFence, we are a team of tech-enthusiasts, industry watchers and content creators who believe that technology isn’t just about gadgets—it’s about how innovation transforms our lives, work and society. We’ve come together to build a place where readers, thinkers and industry insiders can converge to explore what’s next in tech.

    Our Picks

    Agentic AI is not the starting point, it’s the destination 

    May 11, 2026

    TX Fiber taps Vecima for XGS-PON roll out

    May 11, 2026

    Subscribe to Updates

    Please enable JavaScript in your browser to complete this form.
    Loading
    • About Us
    • Contact Us
    • Disclaimer
    • Privacy Policy
    • Terms and Conditions
    © 2026 Geekfence.All Rigt Reserved.

    Type above and press Enter to search. Press Esc to cancel.