Close Menu
geekfence.comgeekfence.com
    What's Hot

    The US Is About to Deport an Immigrant to the Center of the Ebola Crisis

    June 26, 2026

    Subsea resilience needs to move beyond cable count – here’s why (Reader Forum)

    June 26, 2026

    Optimizing cloud economics with linear elastic caching

    June 26, 2026
    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    Facebook Instagram
    geekfence.comgeekfence.com
    • Home
    • UK Tech News
    • AI
    • Big Data
    • Cyber Security
      • Cloud Computing
      • iOS Development
    • IoT
    • Mobile
    • Software
      • Software Development
      • Software Engineering
    • Technology
      • Green Technology
      • Nanotechnology
    • Telecom
    geekfence.comgeekfence.com
    Home»Artificial Intelligence»Optimizing cloud economics with linear elastic caching
    Artificial Intelligence

    Optimizing cloud economics with linear elastic caching

    AdminBy AdminJune 26, 2026No Comments3 Mins Read0 Views
    Facebook Twitter Pinterest LinkedIn Telegram Tumblr Email
    Optimizing cloud economics with linear elastic caching
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Testing linear elastic caching

    To ensure our theory holds up in the real world, we conducted extensive experiments using two primary sources:

    1. Production workloads: We integrated the system into Spanner.
    2. Public traces: We tested against a variety of publicly available cache traces from industry benchmarks to ensure the results weren’t specific to Google’s infrastructure.

    Production workloads

    We developed a practical algorithm that assigns a time-to-live (TTL) to the cached page on each page request based on the page’s access patterns and costs. Because Spanner handles billions of requests per second, this TTL prediction model has to be incredibly lightweight. We opted for a shallow decision tree that can be translated into a few lines of C++ code. The resulting code is also easily interpretable and provides valuable insights on the workload characteristics. This model considers features such as the size of the data, the cost of a cache miss (when data isn’t in the cache and the system needs to retrieve it from some other, slower system like a disk), and the type of database operation being performed to predict the optimal TTL for each page.

    We integrated the elastic caching policy into Spanner’s production servers over several months. Compared to a standard fixed-size cache, the results were substantial:

    • Memory usage: Reduced by 15.5%.
    • Cache misses: Increased by only 5.5%.
    • Total cost of ownership (TCO): Reduced by approximately 5%.

    Crucially, because the algorithm is “cost-aware,” the small increase in cache misses was concentrated on data that is cheap to fetch from storage, meaning the impact on actual I/O costs was a negligible 0.5%.

    Public traces

    We also evaluated our elastic caching approach using several publicly available cache traces. We used an optimized implementation of the greedy dual size frequency (GDSF) eviction algorithm — a generalization of the well-known LRU policy that allows for pages of different sizes — as a fixed cache size baseline policy.

    We considered four variants of elastic caching depending on which ski rental algorithm we used and whether or not we used a machine learned model. Since the available public traces don’t have application-level features available for training, we didn’t implement decision trees for prediction. Instead, we developed a simple learning strategy that splits each trace in half and uses the first half for training. For each individual page in the training trace, we computed the best TTL for the page that minimizes the cost over the training trace.

    Since the behavior of the cache changes depending on what’s initially in the cache, a common practice, known as “warming up”, is to use some prefix of the cache trace to populate the cache but not actually measure performance on it. We warmed up all caches with one day’s worth of requests from the second half of the trace and used the rest for testing and measurements. During the test trace, if we encountered a page that was seen during training, we set the TTL to be the best precomputed TTL for that page. Otherwise, we set the TTL using either the breakeven or randomized policies.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    Scaling cybercrime disruption through innovation and AI

    June 25, 2026

    Exploring the societal impacts of AI | MIT News

    June 24, 2026

    Clustering Unstructured Text with LLM Embeddings and HDBSCAN

    June 23, 2026

    SpaceX wants to build AI data centers in space. Will it work?

    June 22, 2026

    DataRobot for Developers — integrating with the Google Antigravity CLI

    June 21, 2026

    Building AI Agents and Workflows for Every Role Without Coding with Great Learning

    June 20, 2026
    Top Posts

    Understanding U-Net Architecture in Deep Learning

    November 25, 202556 Views

    Hard-braking events as indicators of road segment crash risk

    January 14, 202630 Views

    Redefining AI efficiency with extreme compression

    March 25, 202628 Views
    Don't Miss

    The US Is About to Deport an Immigrant to the Center of the Ebola Crisis

    June 26, 2026

    The United States is preparing to deport a 78-year-old retired pastor to Uganda, a country…

    Subsea resilience needs to move beyond cable count – here’s why (Reader Forum)

    June 26, 2026

    Optimizing cloud economics with linear elastic caching

    June 26, 2026

    Run isolated sandboxes with full lifecycle control: AWS Lambda introduces MicroVMs

    June 26, 2026
    Stay In Touch
    • Facebook
    • Instagram
    About Us

    At GeekFence, we are a team of tech-enthusiasts, industry watchers and content creators who believe that technology isn’t just about gadgets—it’s about how innovation transforms our lives, work and society. We’ve come together to build a place where readers, thinkers and industry insiders can converge to explore what’s next in tech.

    Our Picks

    The US Is About to Deport an Immigrant to the Center of the Ebola Crisis

    June 26, 2026

    Subsea resilience needs to move beyond cable count – here’s why (Reader Forum)

    June 26, 2026

    Subscribe to Updates

    Please enable JavaScript in your browser to complete this form.
    Loading
    • About Us
    • Contact Us
    • Disclaimer
    • Privacy Policy
    • Terms and Conditions
    © 2026 Geekfence.All Rigt Reserved.

    Type above and press Enter to search. Press Esc to cancel.