Close Menu
geekfence.comgeekfence.com
    What's Hot

    Investigation Finds Donut Lab Made False Claims About Revolutionary Battery Tech

    June 10, 2026

    PLDT preps $400M data center listing

    June 10, 2026

    Should HR Professionals Invest in AI and Automation Skills?

    June 10, 2026
    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    Facebook Instagram
    geekfence.comgeekfence.com
    • Home
    • UK Tech News
    • AI
    • Big Data
    • Cyber Security
      • Cloud Computing
      • iOS Development
    • IoT
    • Mobile
    • Software
      • Software Development
      • Software Engineering
    • Technology
      • Green Technology
      • Nanotechnology
    • Telecom
    geekfence.comgeekfence.com
    Home»Big Data»Announcing the Databricks storage ecosystem: Governing the enterprise data estate, wherever it lives
    Big Data

    Announcing the Databricks storage ecosystem: Governing the enterprise data estate, wherever it lives

    AdminBy AdminJune 10, 2026No Comments9 Mins Read0 Views
    Facebook Twitter Pinterest LinkedIn Telegram Tumblr Email
    Announcing the Databricks storage ecosystem: Governing the enterprise data estate, wherever it lives
    Share
    Facebook Twitter LinkedIn Pinterest Email


    The Data That Can’t Move

    For years, the enterprise data strategy was simple: move everything to the cloud. Migrate the data lakes and the warehouses to the cloud, and then governance follows. It was a clean story — until it wasn’t.

    Today, some of the world’s most sophisticated enterprises are telling us clearly: they cannot — and will not — move all of their data to the cloud. Leading semiconductor manufacturers are training models on engineering-classified datasets that must never leave their premises. Global trading firms sit on massive volumes of historical tick data where the economics of cloud egress make migration impossible. Tier-1 banks have adopted “Hybrid Forever” strategies, modernizing on-premises storage while maintaining strict data sovereignty. Major pharmaceutical companies run millions of daily drug experiments against petabyte-scale on-premises data estates subject to stringent regulatory controls.

    These aren’t edge cases. They represent a structural shift in how enterprises think about data: from “Migrate Everything” to “Govern Everything.”

    The drivers are real and compounding:

    • Data sovereignty & regulation: Financial services, healthcare, and government organizations operate under mandates — GDPR, HIPAA, NIS2, sector-specific data residency rules — that require data to remain within specific jurisdictions or air-gapped environments. Cloud migration is not optional; it is legally prohibited for certain datasets.
    • Data gravity & costs: At petabyte and exabyte scale, the economics of cloud migration break down entirely. Egress fees, storage costs, and sheer data volume make the “move it once” model financially unsustainable. Some of the world’s largest retailers are actively repatriating analytics workloads from cloud back to on-premises infrastructure for precisely this reason.
    • Latency & edge workloads: Retail, manufacturing, and telco workloads require low-latency access to on-premises and edge data. Telecommunications providers ingest enormous volumes of network telemetry on-premises daily to power AI-driven network operations that cannot tolerate cloud round-trips.
    • AI on dark data: Vast stores of backup data, unstructured archives, and secondary datasets — representing hundreds of exabytes across the enterprise — contain immense AI value that has never been unlocked because governance didn’t reach it.

    The signal is unmistakable. We have received requests from hundreds of customers explicitly requesting on-premises and hybrid storage connectivity to Unity Catalog. The Software-Defined Storage (SDS) market stands at hundreds of billions of dollars in 2026, and the enterprise partners who manage this estate — collectively holding more than 2 Zettabytes of data under management — are building with us.

    Introducing the Databricks Storage Ecosystem

    Today, we are excited to announce the Databricks Software-Defined Storage (SDS) Ecosystem — a new partner category purpose-built to bring Databricks Intelligence Platform to enterprise data wherever it lives: on-premises, in private clouds, and at the edge environments. If you are an enterprise running petabytes of data on these platforms today, you no longer have to choose between your existing non-cloud storage infrastructure and Databricks AI.

    For too long, enterprises had to choose between the on-premises storage infrastructure they rely on and the cloud-native AI they want to build. Forcing customers to migrate massive amounts of data using complex pipelines just to unlock that intelligence is a broken model. By uniting these industry-leading partners, we are ending that compromise and delivering Databricks Intelligence directly to where the enterprise data lives. But this launch is just day one. We are building the foundation to ensure that soon, every piece of hybrid data–structured or unstructured–is instantly ready for generative AI without ever copying a byte. — Stephen Orban, SVP, Product Partnerships & Ecosystem, Databricks

    At the heart of this ecosystem is OpenSharing, an open-source protocol for secure, governed data sharing. Our storage partners are implementing OpenSharing servers to expose their data estates directly to Databricks Serverless Compute. The path is simple: the storage partner stands up a OpenSharing endpoint, you connect it to Unity Catalog, and you instantly gain secure, governed access to your on-premise data in Databricks without data migration.

    This integration provides a single, unified catalog across your entire hybrid environment. Customers can now use Databricks Serverless Compute, Genie, AgentBricks, and model training to query and reason over data that never leaves the premises. The result? Zero data movement, no duplication of data and zero compliance risk.

    This is not a roadmap aspiration. Customers can try these integrations today. Partners building these integrations follow the Partner Well-Architected Framework — a technical blueprint covering architecture, security, and certification criteria.

    Customers want to break down data silos and unify all of their Data and AI estate – including large amounts of data that still sits on-premises. Thanks to on-premises storage partners leveraging the open source Open Sharing protocol, customers can now seamlessly unify, govern, and analyze all of their data estate in Databricks Unity Catalog – unlocking the full value of their data in the Databricks Data Intelligence Platform. — Jonathan Keller, VP, Product Management, Databricks

    OpenSharing diagram

    Our Launch Partners

    We are proud to announce integrations with the following leading storage providers:

    Databricks Storage Ecosystem

    MinIO — General Availability (demo, blog)

    MinIO AIStor is the bridge that seamlessly connects the Databricks Data Intelligence Platform with enterprise data that can’t move to the cloud. By natively implementing the open Open Sharing protocol at the storage layer, AIStor eliminates complexity and enables Databricks customers to efficiently query live on-premises Apache Iceberg™️ and Delta tables under full Unity Catalog governance. It extends Serverless Compute, Genie, and Agent Bricks to on-premises data, bringing the full power of the Databricks Platform to an enterprise’s most critical data.

    AI and analytics initiatives are often constrained by where data resides, particularly in environments with strict security, sovereignty, or operational requirements. By bringing native OpenSharing to AIStor, we’re enabling organizations to securely expose data where it lives while giving Databricks seamless access through open standards. This removes a major barrier between enterprise data and AI, allowing organizations to activate previously inaccessible data for AI, analytics, and agentic applications without compromising control. — Ugur Tigli, Chief Technology Officer, MinIO

    Everpure (formerly Pure Storage) — Private Preview (demo, blog)

    Everpure and Databricks enable organizations to use on-prem data directly in the cloud removing the need for data replication or duplication.This is delivered through an OpenSharing connector that bridges data in object storage with databricks core workspaces in a secure and gated manner.

    Everpure and Databricks enable organizations to access and analyze on-premises data directly from the cloud without the need for replication or duplication. Continuously moving data between environments is costly and unsustainable at scale. Customers are looking for a simpler approach that balances cost, compliance, and data sovereignty while reducing operational complexity. — Chadd Kenney, VP of Product Management, Everpure

    Qumulo — Private Preview in July 2026 (blog)

    Qumulo has integrated OpenSharing with its new NeuralSearch, allowing customers to securely share Qumulo-stored data with Databricks across core, cloud, and edge environments—without replication, extra costs, or complexity. Using NeuralSearch, users can discover relevant datasets, including unstructured content, via natural-language queries and seamlessly share those curated tables with Databricks via OpenSharing.

    Organizations can no longer afford the cost, complexity, and delays of copying massive datasets across environments just to support AI and analytics. By combining Qumulo NeuralSearch with Databricks OpenSharing, customers can securely discover, govern, and share both tabular and unstructured data across core data centers, edge locations, and public clouds – in real time, without moving the data itself. Together, we’re helping organizations accelerate AI initiatives, unify governance, and unlock faster time-to-insights from globally distributed data while maintaining a single source of truth. — Brandon Whitelaw, SVP and Head of Product at Qumulo

    VAST Data — Private Preview in August 2026

    VAST Data is extending the VAST AI Operating System with OpenSharing support to help enterprises bridge Databricks workflows with data that resides across on-premises and hybrid infrastructure – without requiring massive data movement or migration. The integration will give customers more flexibility to access, process and operationalize data across cloud, data center and emerging AI infrastructure environments while supporting modern hybrid AI and analytics workloads.

    AI infrastructure is becoming fundamentally hybrid. Customers increasingly want the ability to process data wherever it makes the most sense economically and operationally, while still maintaining seamless access across environments. OpenSharing support extends the VAST AI Operating System’s ability to bridge Databricks workflows with data that resides across cloud and on-premises infrastructure for modern AI and analytics applications. Unlike traditional storage platforms, VAST combines data services, distributed processing and AI infrastructure orchestration into a unified operating system for AI data at scale. — John Mao, Vice President, Global Technology Alliances at VAST Data

    What’s Next

    Integrations Coming Soon

    In addition to our launch partners, momentum across the storage ecosystem continues to accelerate. We have secured commitments from Cohesity, Commvault, HPE, NetApp, Nutanix, and Rubrik —to build native integrations by the end of the year.

    Collectively, these partners, along with launch partners, manage hundreds of exabytes of enterprise data, spanning high-performance unstructured media, secondary backup archives, cost-effective cloud storage, and hyperconverged private cloud estates. 

    Unlocking Unstructured Data

    Today’s launch establishes structured, tabular data as fully governed and accessible across this ecosystem. But we know that exciting opportunity lies ahead in unstructured data: the images, PDFs, videos, medical scans, engineering simulations, and backup archives that represent the majority of enterprise data under management — and the raw material for the next generation of RAG pipelines and fine-tuned models.

    We are actively working to extend the OpenSharing protocol with Volumes APIs — exposing unstructured files from on-premises storage directly to Databricks for GenAI workloads. With this coming, partners managing massive unstructured estates — from media and imaging archives to enterprise backup repositories — will unlock an entirely new class of AI use cases for their customers.

    This is what it means to govern everything.

    Join the Ecosystem

    If you are a storage vendor interested in building an OpenSharing integration, visit the Partner Well Architected Framework or reach out to the Databricks Partner team to get started.

    If you are an enterprise customer who wants to connect your on-premises storage estate to Databricks, contact your account team to learn more.

    The era of “Migrate Everything” is over. The era of “Govern Everything” starts today.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    Three Ways Big Data Has Changed the World of SEO

    June 9, 2026

    GitHub Copilot Just Got Expensive for the Users Who Used It Most |

    June 8, 2026

    Google’s Open-Source Multimodal AI Explained

    June 7, 2026

    How Regulated Industries Ensure Compliant, Unified Omnichannel Communications

    June 6, 2026

    Schedule notebook runs in Amazon SageMaker Unified Studio

    June 4, 2026

    Scaling Enterprise Conversational Intelligence: Cross-industry Technology and Functional Solutions Powered by Databricks Genie

    June 3, 2026
    Top Posts

    Understanding U-Net Architecture in Deep Learning

    November 25, 202550 Views

    Hard-braking events as indicators of road segment crash risk

    January 14, 202630 Views

    Redefining AI efficiency with extreme compression

    March 25, 202627 Views
    Don't Miss

    Investigation Finds Donut Lab Made False Claims About Revolutionary Battery Tech

    June 10, 2026

    In January, there was a lot of hype around an announcement from Finnish company Donut…

    PLDT preps $400M data center listing

    June 10, 2026

    Should HR Professionals Invest in AI and Automation Skills?

    June 10, 2026

    Announcing the Databricks storage ecosystem: Governing the enterprise data estate, wherever it lives

    June 10, 2026
    Stay In Touch
    • Facebook
    • Instagram
    About Us

    At GeekFence, we are a team of tech-enthusiasts, industry watchers and content creators who believe that technology isn’t just about gadgets—it’s about how innovation transforms our lives, work and society. We’ve come together to build a place where readers, thinkers and industry insiders can converge to explore what’s next in tech.

    Our Picks

    Investigation Finds Donut Lab Made False Claims About Revolutionary Battery Tech

    June 10, 2026

    PLDT preps $400M data center listing

    June 10, 2026

    Subscribe to Updates

    Please enable JavaScript in your browser to complete this form.
    Loading
    • About Us
    • Contact Us
    • Disclaimer
    • Privacy Policy
    • Terms and Conditions
    © 2026 Geekfence.All Rigt Reserved.

    Type above and press Enter to search. Press Esc to cancel.