Gemma Gem – AI model embedded in a browser – no API keys, no cloud

Gemma Gem is a Chrome extension that loads Google's Gemma 4 (2B) through WebGPU in an offscreen document and gives it tools to interact with any webpage: read content, take screenshots, click elements, type text, scroll, and run JavaScript.You get a small chat overlay on every page.

Gemma Gem – AI model embedded in a browser – no API keys, no cloud
Gemma Gem – AI model embedded in a browser – no API keys, no cloud Photo: Hacker News

Gemma Gem is a Chrome extension that loads Google's Gemma 4 (2B) through WebGPU in an offscreen document and gives it tools to interact with any webpage: read content, take screenshots, click elements, type text, scroll, and run JavaScript.

You get a small chat overlay on every page.

Ask it about the page and it (usually) figures out which tools to call.

It has a thinking mode that shows chain-of-thought reasoning as it works.

It's a 2B model in a browser.

It works for simple page questions and running JavaScript, but multi-step tool chains are unreliable and it sometimes ignores its tools entirely.

The agent loop has zero external dependencies and can be extracted as a standalone library if anyone wants to experiment with it.

Source: This article was originally published by Hacker News

Read Full Original Article →

Share this article

Comments (0)

No comments yet. Be the first to comment!

Leave a Comment

Maximum 2000 characters