Gemma Gem is a Chrome extension that loads Google's Gemma 4 (2B) through WebGPU in an offscreen document and gives it tools to interact with any webpage: read content, take screenshots, click elements, type text, scroll, and run JavaScript.
You get a small chat overlay on every page.
Ask it about the page and it (usually) figures out which tools to call.
It has a thinking mode that shows chain-of-thought reasoning as it works.
It's a 2B model in a browser.
It works for simple page questions and running JavaScript, but multi-step tool chains are unreliable and it sometimes ignores its tools entirely.
The agent loop has zero external dependencies and can be extracted as a standalone library if anyone wants to experiment with it.
Related Stories
Source: This article was originally published by Hacker News
Read Full Original Article →
Comments (0)
No comments yet. Be the first to comment!
Leave a Comment