#browser-based-ai tag

Web development

The duality of language models in the browser

Small language models running in browsers offer low-cost, private, accessible AI without per-token fees or backend requirements.

fromDEV Community

2 months ago

I Built a 100% Private, On-Device AI Audio Stem Splitter (No Servers!)

If you've ever used tools like PhonicMind or LALAL.AI, you know the drill: Upload your MP3. Wait in a queue. Pay for "credits" or high-quality downloads. Your file sits on someone else's server. For musicians, producers, or just karaoke fans, this is slow and privacy-invasive.

Music production

Artificial intelligence

fromMarTech

3 months ago

The most useful AI tool might already be in your browser | MarTech

Browser-based AI assistants provide immediate screen context, eliminate copy-paste friction, and boost productivity by acting as always-on, screen-aware personal assistants.

fromComputerworld

6 months ago

Enterprises should not install OpenAI's new Atlas browser, analysts warn

The browser would have more value if it included an on-device AI model that could run without requiring access to the internet, O'Donnell said. "This provides a channel through which they can get hundreds of millions of people to download their model," he said. In that scenario, the browser could access heavyweight AI models in the cloud to handle more demanding tasks.

Artificial intelligence

#gemini-25-computer-use

fromZDNET

7 months ago

Artificial intelligence

This new Google Gemini model scrolls the internet just like you do - how it works

fromZDNET

7 months ago

Artificial intelligence

Google's new Gemini 2.5 Computer Use model can click, type, and scroll

fromZDNET

7 months ago

Artificial intelligence

This new Google Gemini model scrolls the internet just like you do - how it works

fromZDNET

7 months ago

Artificial intelligence

Google's new Gemini 2.5 Computer Use model can click, type, and scroll

more#gemini-25-computer-use

fromThe Verge

7 months ago

Google's latest AI model uses a web browser like you do

Google is previewing a new Gemini AI model designed to navigate and interact with the web via a browser, letting AI agents do things inside interfaces designed for use by people and not robots. The model, called Gemini 2.5 Computer Use, uses "visual understanding and reasoning capabilities" to analyze a user's request and carry out a task, such as filling out and submitting a form. It can be used for UI testing or navigating interfaces made for people who don't have an API or other direct connection available.

Artificial intelligence

fromZDNET

8 months ago

Anthropic's Claude Chrome browser extension rolls out - how to get early access

Anthropic's first effort is a closed beta of a Chrome web browser extension. With this extension, you'll be able to chat with Claude in a persistent side panel that maintains context from active browser sessions. Beyond conversational AI, the extension can read, navigate, and take actions within websites. These actions can include tasks such as locating listings on Zillow, summarizing documents, or adding items to shopping carts -- directly from the browser sidebar.

Artificial intelligence

#browser-based-ai#browser-based-ai

The duality of language models in the browser

I Built a 100% Private, On-Device AI Audio Stem Splitter (No Servers!)

The most useful AI tool might already be in your browser | MarTech

Enterprises should not install OpenAI's new Atlas browser, analysts warn

This new Google Gemini model scrolls the internet just like you do - how it works

Google's new Gemini 2.5 Computer Use model can click, type, and scroll

This new Google Gemini model scrolls the internet just like you do - how it works

Google's new Gemini 2.5 Computer Use model can click, type, and scroll

Google's latest AI model uses a web browser like you do

Anthropic's Claude Chrome browser extension rolls out - how to get early access

#browser-based-ai
#browser-based-ai