Towards making street view accessible via context-aware multimodal AI

AI Chat

AI Chat builds on AI Describer but allows users to ask questions about their current view, past views, and nearby geography. The chat agent uses Google’s Multimodal Live API, which supports real-time interaction, function calling, and temporarily retains memory of all interactions within a single session. We track and send each pan or movement interaction along with the user’s current view and geographic context (e.g., nearby places, current heading).

What makes AI Chat so powerful is its ability to hold a temporary “memory” of the user’s session — the context window is set to a maximum of 1,048,576 input tokens, which is roughly equivalent to over 4k input images. Because AI Chat receives the user’s view and location with every virtual step, it collects information about the user’s location and context. A user can virtually walk past a bus stop, turn a corner, and then ask, “Wait, where was that bus stop?” The agent can recall its previous context, analyze the current geographic input, and answer, “The bus stop is behind you, approximately 12 meters away.”

Source link

What's Hot

HCLTech acquires HPE telco unit

This tiny chip could change the future of quantum computing

What’s In a Name? Mainframe GDGs Get the Job Done

Towards making street view accessible via context-aware multimodal AI

This tiny chip could change the future of quantum computing

Why Enterprise AI Scale Stalls

Combining AI and Automation to Improve Employee Productivity in 2026

Understanding LoRA with a minimal example

AI Wrapped: The 14 AI terms you couldn’t avoid in 2025

AI, MCP, and the Hidden Costs of Data Hoarding – O’Reilly

Understanding U-Net Architecture in Deep Learning

Microsoft 365 Copilot now enables you to build apps and workflows

Here’s the latest company planning for gene-edited babies

HCLTech acquires HPE telco unit

This tiny chip could change the future of quantum computing

What’s In a Name? Mainframe GDGs Get the Job Done

Microsoft named a Leader in Gartner® Magic Quadrant™ for AI Application Development Platforms

Our Picks

HCLTech acquires HPE telco unit

This tiny chip could change the future of quantum computing

What's Hot

Towards making street view accessible via context-aware multimodal AI

AI Chat

Related Posts

Subscribe to Updates