feat: Add TOON (Token-Oriented Object Notation) support for efficient data serialization #82

meirk-brd · 2025-11-04T08:11:33Z

Summary

Optimizes token consumption for structured and markdown data:

TOON Format Integration: Adds Token-Oriented Object Notation for web_data_* tools, reducing tokens by 10-30% vs JSON while maintaining readability
Markdown Minification: Implements remark + strip-markdown plugin for scrape_as_markdown tool, achieving 60% token reduction by removing base64 images, HTML tags, and formatting while preserving meaningful content
Google Search Response Sanitization: Cleanses search results by normalizing whitespace, removing Unicode formatting characters, deduplicating related keywords, and extracting only essential fields (link, title, description) from organic results (Thanks to Nikita for this valuable feedback | example response)

Integrated TOON serialization library for compact data representation
Added remark/strip-markdown processing pipeline for markdown responses
Implemented clean_google_search_payload() to sanitize and normalize search engine results

(Still being tested and evaluated for efficiency)

meirk-brd added 3 commits November 3, 2025 18:57

Add token optimization logic

f752e7c

add remark+strip for markdown minification

f95fbe1

sanitize google responses for token reduction

08c6c10