Close Menu
geekfence.comgeekfence.com
    What's Hot

    Honolulu gambling raid in Waimakua Place nets machines

    June 14, 2026

    Expanding services and exploring NTN use at 900 MHz

    June 14, 2026

    Jinhua Zhao named head of the Department of Urban Studies and Planning | MIT News

    June 14, 2026
    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    Facebook Instagram
    geekfence.comgeekfence.com
    • Home
    • UK Tech News
    • AI
    • Big Data
    • Cyber Security
      • Cloud Computing
      • iOS Development
    • IoT
    • Mobile
    • Software
      • Software Development
      • Software Engineering
    • Technology
      • Green Technology
      • Nanotechnology
    • Telecom
    geekfence.comgeekfence.com
    Home»Cloud Computing»Deloitte Japan Advances Security Operations with Cisco Foundation AI’s Open-Source Model
    Cloud Computing

    Deloitte Japan Advances Security Operations with Cisco Foundation AI’s Open-Source Model

    AdminBy AdminJune 14, 2026No Comments6 Mins Read0 Views
    Facebook Twitter Pinterest LinkedIn Telegram Tumblr Email
    Deloitte Japan Advances Security Operations with Cisco Foundation AI’s Open-Source Model
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Introduction 

    We are excited to announce that Deloitte Japan is beginning production validation of Cisco Foundation AI’s Foundation-sec-1.1-8B-Instruct model for its security operations. By using this security-focused, open-source large language model (LLM), Deloitte Japan has automated key tasks such as security alert analysis, prioritization, and false positive reduction. This adoption highlights how open-source generative AI can enhance traditional security operations and offers practical insight into implementing purpose-driven workflows with cost-effective LLMs.  

    Background 

    As a managed security service provider, Deloitte Japan receives numerous security alerts from customer environments every day and must analyze and triage them. Some of these tasks are labor-intensive, such as analyzing raw alert logs and drafting summaries for each alert. Others require specific security knowledge and experience, like identifying false positives and creating suppression rules to prevent similar issues from recurring. 

    By implementing Cisco Foundation AI’s Foundation-sec-1.1-8B-Instruct model, Deloitte Japan has streamlined these tasks using workflows based on human analysts’ expertise. This approach accelerates alert triage and improves detection quality. Thanks to task-specific prompt tuning and workflow design, Deloitte Japan achieved stable and accurate results with the Foundation-sec-1.1-8B-Instruct model, matching the performance of models with over 15 times more parameters. 

    Based on this approach, Deloitte Japan is now introducing LLM-driven automation into the SOC workflow. The objective is not full automation of every analyst task, but practical automation of the most repetitive and time-consuming parts of alert handling. 



    Figure 1: SOC workflow and target areas for LLM-based automation.

    Workflows 

    Using the Foundation-sec-1.1-8B-Instruct model, Deloitte Japan developed three core workflows.

    1. Alert Analysis Support 

    This workflow supports analysts in alert analysis. It analyzes alerts handled by security analysts, assesses the impact of an attack, and provides the results along with the steps leading to the decision. 

    Figure 2: Agent workflow for alert analysis support. 

    As shown in Figure 2, the agent performs alert ingestion, targeted event collection, grounding, filtering/deduplication, enrichment, assessment, report generation, and follow-up guidance. 

    Specifically, it performs alert ingestion from SIEM; targeted event collection from IPS and EDR around the alert window; retrieval-augmented grounding against runbooks, prior cases, detection notes, and pre-attached threat intelligence or auxiliary logs; relevance filtering and deduplication; asset/user/context enrichment; severity and impact assessment; draft case-note/report generation; and follow-up guidance.  

    Figure 3: Example output of the analysis. 

    As shown in Figure 3, the output supports rationale, key evidence, uncertainty drivers, and an auditable step-by-step analysis trace. It also provides follow-up guidance (next actions and auto-closure criteria for clearly low-risk cases). The next steps are production validation and selective automation for well-bounded low-risk scenarios, with a human in the loop for anything ambiguous. 

    2. Alert Severity Analysis and Prioritization (Alert Triage)


    Figure 4: Agent workflow for alert severity analysis and prioritization
    .
     

    This workflow analyzes EDR alerts using alert details and related telemetry to support prioritization and identify likely false positives. As shown in Figure 4, the agent performs alert retrieval, event collection, relevance filtering, severity assessment, report drafting, and follow-up guidance.

    To improve output quality, the workflow uses surrounding EDR activity in addition to the alert itself, while controlling event scope to avoid excessive context. It also separates severity assessment, report drafting, and next-step guidance to reduce context drift and improve output stability.
    As shown in Figure 5, the output includes not only a severity label but also supporting rationale and uncertainty-related information that can guide analyst review. The next step is production validation and selective automation for clearly low-risk cases. The remaining challenge is robust evaluation of low-severity and false-positive scenarios. 

    Figure 5: Example output of the triage. 

    3. Alert Suppression Rule Creation based on False Positive Cases 

    In this workflow, the agent uses incident data recorded in tickets. Based on that data, it produces a suppression rule that suppresses only alerts linked to events determined to be false positives. It also outputs the reasoning behind the rule. When a false positive involves misuse of legitimate tools, such as Living off the Land attacks, the suppression rule needs to reflect how the tools were used. 

    Figure 6: Agent workflow for Alert Suppression Rule Creation based on False Positive Cases. 

    As shown in Figure 6, this workflow runs in several phases. To support accurate decisions, the process is broken down so that each task maps to a single node, and the graph structure enables branching based on each decision outcome. As shown in Figure 7, the workflow outputs the suppression rule. Rather than having the model generate the rule conditions directly, it first selects the necessary conditions from incident-related entities and then assembles them. This is intended to improve the consistency and reproducibility of the conditions and increase the success rate of assembling the rule. 

    Figure 7: Agent workflow for Alert Suppression Rule Creation based on False Positive Cases  

    These workflows can support security operations by providing summarized analysis for each alert, determining severity to identify critical or false positive cases, and generating effective suppression rules to filter out false positives in the future. With these outputs, security analysts can quickly understand the content of each alert. Severity scores help analysts focus on the most critical alerts. By applying suppression rules, analysts avoid being overwhelmed by insignificant alerts and can focus on what matters most.  

    Optimizations 

    The Foundation-sec-1.1-8B-Instruct model is a relatively small LLM with only 8 billion parameters, which keeps inference costs low and makes practical deployment easier. To match the performance of much larger models, Deloitte Japan applied several optimization techniques. 

    One effective technique was to break tasks into multiple steps within a workflow, rather than using a single, complex prompt. Workflows were designed based on human analysts’ experience, with steps such as extracting key information from alerts, reasoning over extracted values and patterns, and generating outputs based on previous steps. This allows the model to focus on each step with sufficient context and leverage organization-specific logic to ensure outputs are useful in production. 

    Another technique was to use structured outputs during intermediate steps. By specifying JSON-formatted output, the workflow can pass important information between steps more reliably, reduce ambiguity, and support smoother integration with downstream processing. 

    RAG is also used to improve the accuracy of the analysis. By using a combination of the security analyst’s analytical knowledge, monitored asset information, and historical response history, the agent can suggest actions more closely aligned with an analyst’s judgment.  

    Conclusion 

    The integration of Cisco Foundation AI’s Foundation-sec-1.1-8B-Instruct model into Deloitte Japan’s security operations marks a significant milestone in using open-source, security-focused AI models to accelerate and streamline security tasks. This helps reduce SOC analyst workload and improve productivity. We extend our sincere gratitude to the Deloitte Japan team for their outstanding implementation and for sharing the details of this use case. 

    Customer Testimonials

    “Through this PoV, Deloitte Japan confirmed that Cisco Foundation AI’s security-focused open-source model can support practical SOC automation, including alert analysis, prioritization, and false-positive reduction. By turning analyst expertise into structured workflows, we achieved explainable outputs with rationale and evidence. The results show that even an 8B model can deliver stable outcomes when combined with workflow design and structured outputs.” 

    — Kohei Sato, Partner, Head of Cyber Intelligence Center, Deloitte Tohmatsu Cyber LLC 



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    Now available: Amazon EC2 M9g and M9gd instances powered by new AWS Graviton5 processors

    June 12, 2026

    Apple’s iOS 27 Beta Fuels Foldable iPhone Rumors After WWDC 2026

    June 11, 2026

    NTT DATA expands Google Cloud work on Gemini Enterprise

    June 10, 2026

    Beware of the genAI token trap

    June 9, 2026

    Powering the AI-ready branch with agentic operations and quantum-era security

    June 8, 2026

    Cisco Customer Achievement Awards AMER 2026: Honoring Those Transforming IT

    June 7, 2026
    Top Posts

    Understanding U-Net Architecture in Deep Learning

    November 25, 202552 Views

    Hard-braking events as indicators of road segment crash risk

    January 14, 202630 Views

    Redefining AI efficiency with extreme compression

    March 25, 202627 Views
    Don't Miss

    Honolulu gambling raid in Waimakua Place nets machines

    June 14, 2026

    Honolulu police say a search warrant served at an alleged illegal gambling room on Waimakua…

    Expanding services and exploring NTN use at 900 MHz

    June 14, 2026

    Jinhua Zhao named head of the Department of Urban Studies and Planning | MIT News

    June 14, 2026

    How to Generate AI Videos using Gemini

    June 14, 2026
    Stay In Touch
    • Facebook
    • Instagram
    About Us

    At GeekFence, we are a team of tech-enthusiasts, industry watchers and content creators who believe that technology isn’t just about gadgets—it’s about how innovation transforms our lives, work and society. We’ve come together to build a place where readers, thinkers and industry insiders can converge to explore what’s next in tech.

    Our Picks

    Honolulu gambling raid in Waimakua Place nets machines

    June 14, 2026

    Expanding services and exploring NTN use at 900 MHz

    June 14, 2026

    Subscribe to Updates

    Please enable JavaScript in your browser to complete this form.
    Loading
    • About Us
    • Contact Us
    • Disclaimer
    • Privacy Policy
    • Terms and Conditions
    © 2026 Geekfence.All Rigt Reserved.

    Type above and press Enter to search. Press Esc to cancel.