Marionette icon

Marionette

by Younes Laaroussi

v1.0.0 Updated Oct 25, 2025 3.81MiB
CWS
20
Users
โ˜… 0.00
0 reviews
#21011
of 49.1K
accessibility
#739 of 1.3K

Description

๐— ๐—”๐—ฅ๐—œ๐—ข๐—ก๐—˜๐—ง๐—ง๐—˜ An AI browser automation agent that runs entirely on your device. Control websites using natural language through voice or text. Zero data leaves your machine. โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ” ๐—ช๐—›๐—”๐—ง ๐—œ๐—ง ๐——๐—ข๐—˜๐—ฆ Marionette automates web browsing tasks using Chrome's built-in Gemini Nano AI. Tell it to "fill this form", "search for X", "capture this page", or "what's this video about?" and it handles the task autonomously. Everything runs locally on your device after a one-time model download (~2GB). No API keys, no subscriptions, no servers, no tracking. โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ” ๐—ฃ๐—ฅ๐—œ๐—ฉ๐—”๐—–๐—ฌ After initial setup, the extension operates 100% offline. Your conversations, captured pages, and browsing data stay in local browser storage. Nothing is transmitted to external servers. You can verify this by opening Chrome DevTools Network tab during use - you'll see zero outbound requests. โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ” ๐—–๐—”๐—ฃ๐—”๐—•๐—œ๐—Ÿ๐—œ๐—ง๐—œ๐—˜๐—ฆ ๐˜‰๐˜ณ๐˜ฐ๐˜ธ๐˜ด๐˜ฆ๐˜ณ ๐˜ˆ๐˜ถ๐˜ต๐˜ฐ๐˜ฎ๐˜ข๐˜ต๐˜ช๐˜ฐ๐˜ฏ Click buttons, fill forms, scroll pages, navigate tabs. Uses accessibility tree to understand page structure and interact with elements. ๐˜๐˜ฐ๐˜ช๐˜ค๐˜ฆ & ๐˜›๐˜ฆ๐˜น๐˜ต ๐˜๐˜ฏ๐˜ฑ๐˜ถ๐˜ต Voice control using Web Speech API (on-device transcription) or text commands. Supports wake word detection. ๐˜”๐˜ถ๐˜ญ๐˜ต๐˜ช๐˜ฎ๐˜ฐ๐˜ฅ๐˜ข๐˜ญ ๐˜œ๐˜ฏ๐˜ฅ๐˜ฆ๐˜ณ๐˜ด๐˜ต๐˜ข๐˜ฏ๐˜ฅ๐˜ช๐˜ฏ๐˜จ Process screenshots ("what do you see?") and audio ("what's playing?"). Gemini Nano handles image and audio analysis locally. ๐˜š๐˜ฆ๐˜ฎ๐˜ข๐˜ฏ๐˜ต๐˜ช๐˜ค ๐˜๐˜ข๐˜ถ๐˜ญ๐˜ต Automatically captures pages you visit. Creates embeddings using Transformers.js for semantic search. Query your browsing history with natural language. ๐˜ˆ๐˜จ๐˜ฆ๐˜ฏ๐˜ต๐˜ช๐˜ค ๐˜“๐˜ฐ๐˜ฐ๐˜ฑ The AI observes, plans, acts, and repeats until task completion. Supports up to 60 tool iterations for complex workflows. โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ” ๐—จ๐—ฆ๐—˜ ๐—–๐—”๐—ฆ๐—˜๐—ฆ โ€ข "Fill this contact form" - Populates fields from stored information โ€ข "Capture this article for later" then "find that article about X" - Semantic search through your history โ€ข "Open YouTube and search for tutorials" - Multi-step navigation โ€ข "Summarize this page" - Content extraction and analysis โ€ข "What's this video about?" - Audio transcription and summarization โ€ข Voice control for accessibility when mouse/keyboard interaction is difficult โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ” ๐—ฆ๐—˜๐—ง๐—จ๐—ฃ 1. Install the extension 2. Enable required Chrome flags (interactive guide included) 3. Download models (~2GB one-time) 4. Grant microphone permission if using voice 5. Start using it The onboarding walks through flag setup with direct links to the correct chrome:// pages. โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ” ๐——๐—”๐—ง๐—” ๐—ฆ๐—ง๐—ข๐—ฅ๐—”๐—š๐—˜ All data stored locally: โ€ข Conversation history (Chrome Extension storage) โ€ข Captured pages and embeddings (IndexedDB) โ€ข Optional user info for form autofill (Chrome Extension storage) โ€ข Screenshots and audio (temporary, local only) Nothing transmitted to external servers. Check DevTools Network tab to verify. โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ” ๐—ง๐—˜๐—–๐—›๐—ก๐—œ๐—–๐—”๐—Ÿ ๐——๐—˜๐—ง๐—”๐—œ๐—Ÿ๐—ฆ ๐˜ˆ๐˜ ๐˜š๐˜ต๐˜ข๐˜ค๐˜ฌ: โ€ข Gemini Nano (Chrome Prompt API) โ€ข Transformers.js (all-MiniLM-L6-v2 embeddings) โ€ข Web Speech API (voice transcription) โ€ข Readability.js (content extraction) ๐˜ˆ๐˜ณ๐˜ค๐˜ฉ๐˜ช๐˜ต๐˜ฆ๐˜ค๐˜ต๐˜ถ๐˜ณ๐˜ฆ: โ€ข 22 specialized automation tools โ€ข Agentic loopback (observe โ†’ act โ†’ repeat) โ€ข Chunk-based RAG for semantic memory โ€ข TypeScript + React + Plasmo framework Open source under MIT License. โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ” ๐—ฅ๐—˜๐—ค๐—จ๐—œ๐—ฅ๐—˜๐— ๐—˜๐—ก๐—ง๐—ฆ โ€ข Chrome browser (latest version) โ€ข ~2GB disk space (one-time model download) โ€ข Chrome flags enabled (setup guide included) โ€ข Microphone for voice control (optional) โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ” ๐—Ÿ๐—œ๐— ๐—œ๐—ง๐—”๐—ง๐—œ๐—ข๐—ก๐—ฆ This is an on-device AI system, which comes with trade-offs: โ€ข Gemini Nano is smaller than cloud models (3B parameters vs GPT-4's hundreds of billions) โ€ข Inference takes 1-3 seconds per response instead of sub-second for cloud APIs โ€ข Context window is 9,216 tokens (auto-summarization helps manage this) โ€ข Models take ~2GB disk space The benefit: complete privacy. Your data never leaves your device. โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ” ๐—ฆ๐—จ๐—ฃ๐—ฃ๐—ข๐—ฅ๐—ง Email: hello@youneslaaroussi.ca GitHub: Issues and pull requests welcome License: MIT (free and open source)
Marionette screenshot 1Marionette screenshot 2Marionette screenshot 3Marionette screenshot 4

Reviews

Loading reviews...

Permissions (9)

Permissions

activeTabโ„น Can access the current tab when you click the extension audioCapture scriptingโ„น Can inject scripts into web pages sidePanel storageโ„น Can store data locally in your browser tabCaptureโ„น Can capture tab audio/video tabsโ„น Can see your open tabs and their URLs unlimitedStorage webNavigationโ„น Can monitor navigation events

Details

Version 1.0.0
Updated Oct 25, 2025
Size 3.81MiB
First Seen Mar 22, 2026