openclaw

mirror of https://github.com/openclaw/openclaw.git synced 2026-05-02 14:40:27 +02:00

Author	SHA1	Message	Date
Vincent Koc	fbd6b3ce3c	docs(tts): A-Z order providers and add tools/tts to Tools nav group - docs/tools/tts.md: alphabetize providers in three places that listed them: the supported-providers table (Azure Speech ... Xiaomi MiMo), the configuration Tabs (12 provider presets in A-Z), and the field reference AccordionGroup. Top-level fields stay first; provider tabs/accordions follow strict alphabetical order. Wording, schema, and defaults unchanged. - docs/docs.json: add tools/tts to the main Tools sidebar group (slotted between trajectory and video-generation, matching the alphabetical neighborhood with image-generation, music-generation, video-generation). Previously tts only appeared under Nodes > Media capabilities, which was a discoverability gap for readers looking for TTS alongside the other generation tools.	2026-04-25 22:05:46 -07:00
Vincent Koc	71b79f49ad	docs(tts): rewrite tts.md around personas with Mintlify components The TTS doc had grown to 1008 lines with 11 separate flat 'X primary' config blocks, a 100-line dense 'Notes on fields' bullet list, and the new provider-personas feature (#70748) buried near the bottom. Restructure for readability and feature visibility: - Lead with a Steps-based 'Quick start' so first-time readers can enable TTS in 4 explicit steps. - Replace the 13-bullet provider list with a single 'Supported providers' table that names auth env vars and per-provider notes inline. Add a Warning callout for the Microsoft/edge legacy alias. - Collapse the 11 'X primary' config blocks into one Tabs component ('OpenAI + ElevenLabs', 'Google Gemini', 'Azure Speech', 'Microsoft (no key)', 'MiniMax', 'Inworld', 'xAI', 'Volcengine', 'Xiaomi MiMo', 'OpenRouter', 'Gradium', 'Local CLI') so users see one preset at a time and the page is scannable. - Promote 'Personas' to its own top-level section with two examples (minimal and the Alfred provider-neutral persona), and add a new 'How providers use persona prompts' AccordionGroup covering Google (promptTemplate audio-profile-v1, personaPrompt), OpenAI (instructions auto-mapping), and Other providers, plus a fallback policy table. - Note that agents.list[].tts.persona overrides global persona per-agent (covers the recent feat(tts) per-agent voice-override work). - Convert the 100-line 'Notes on fields' wall into a per-provider AccordionGroup using ParamField, so the field reference is scannable and field types/defaults are visually distinct. - Sentence-case headings, drop redundant body H1, fold the flow diagram inline with Auto-TTS behavior, and refresh the Output formats section to a table-first layout. - Schema fields (label/description/provider/fallbackPolicy/prompt with profile/scene/sampleContext/style/accent/pacing/constraints and providers map) verified against src/config/types.tts.ts; all defaults and env-var fallbacks preserved verbatim. Net diff: 585 insertions, 684 deletions across the same surface area.	2026-04-25 22:00:19 -07:00
Peter Steinberger	6a67f65568	fix(voice): reuse preflight transcripts across channels	2026-04-26 05:42:04 +01:00
Barron Roth	0594fa3c4d	TTS: add provider personas	2026-04-26 09:42:38 +05:30
Peter Steinberger	9ed11d6c49	fix: steer agents to safe gateway config flow	2026-04-26 05:00:17 +01:00
Peter Steinberger	540c70d166	fix(plugins): ignore bundled load path aliases	2026-04-26 04:46:05 +01:00
Peter Steinberger	4edf22f63f	fix(acpx): avoid startup agent probes by default	2026-04-26 04:40:26 +01:00
Peter Steinberger	ed1ac2fc44	feat(browser): add CDP role snapshot fallback	2026-04-26 04:40:26 +01:00
Peter Steinberger	6d4f65c9d4	docs: clarify codex runtime routing	2026-04-26 04:38:39 +01:00
Peter Steinberger	2c8c79de5c	fix(tts): normalize streamed tts voice media	2026-04-26 04:28:19 +01:00
Peter Steinberger	a91baa16de	fix(tts): honor explicit directive providers	2026-04-26 04:14:48 +01:00
Peter Steinberger	cf834e2a21	fix(tts): clean streamed directive text	2026-04-26 04:09:56 +01:00
Peter Steinberger	7a85c1a822	fix(tts): surface voice status and harden providers	2026-04-26 03:51:30 +01:00
Peter Steinberger	97ae1c7c2e	feat(tts): add read-latest voice command	2026-04-26 03:44:44 +01:00
Peter Steinberger	3989510251	docs: expand ACP agents guide	2026-04-26 03:42:44 +01:00
Peter Steinberger	f0fa35082b	fix: keep ACP completion prompts harness-safe	2026-04-26 03:39:24 +01:00
Peter Steinberger	6855b33255	docs(tts): clarify WhatsApp voice-note delivery	2026-04-26 03:28:51 +01:00
Peter Steinberger	9b91040053	fix(tts): route WhatsApp MP3 TTS as voice notes	2026-04-26 03:26:00 +01:00
Peter Steinberger	9b4f0779ce	fix(tts): honor per-agent config in tts commands	2026-04-26 03:12:30 +01:00
Peter Steinberger	a6d9926d1d	fix: keep acp management commands local	2026-04-26 03:02:04 +01:00
Peter Steinberger	0ca952cdd5	feat(tts): add per-agent voice overrides	2026-04-26 02:54:13 +01:00
Shivanker Goel	a932a58e87	feat(fal): support Seedance reference video Adds fal Seedance 2.0 reference-to-video support with model-aware reference input limits.	2026-04-26 02:30:23 +01:00
Peter Steinberger	5b80d0c15e	feat(tts): add Azure Speech provider Co-authored-by: Leon Chui <84605354+leonchui@users.noreply.github.com>	2026-04-26 01:42:51 +01:00
Peter Steinberger	81c2a1de26	test: add Droid ACP bind Docker lane	2026-04-26 01:31:27 +01:00
Peter Steinberger	e6ee4d6e68	fix(browser): preserve tabs across target swaps	2026-04-26 01:21:59 +01:00
Vincent Koc	f3accc753c	feat(plugins): add before agent finalize hook (#71765 )	2026-04-25 17:21:17 -07:00
Peter Steinberger	3a4325b285	fix: prevent duplicate channel plugin tools	2026-04-26 01:06:11 +01:00
Shakker	babbad81a9	fix: preserve plugin install records without manifests	2026-04-26 01:03:13 +01:00
Shakker	37ce39b5c5	docs: describe plugin install index store	2026-04-26 01:03:12 +01:00
Peter Steinberger	8e12c24d17	fix: prefer native codex app-server controls	2026-04-26 00:59:02 +01:00
Peter Steinberger	12c16576cd	fix: gate acp spawn affordances	2026-04-26 00:30:27 +01:00
Peter Steinberger	41b27024bb	docs(gateway): clarify backend RPC pairing	2026-04-26 00:26:35 +01:00
Rui Xu	1531123d35	feat(tts): add BytePlus Seed Speech provider Add Volcengine/BytePlus Seed Speech as a bundled TTS provider with current API-key auth, legacy AppID/token fallback, native Ogg/Opus voice-note output, and MP3 audio-file output. Co-authored-by: Peter Steinberger <steipete@gmail.com>	2026-04-25 23:46:04 +01:00
Peter Steinberger	b1b29a8fc2	fix: stabilize remote skill node probes	2026-04-25 23:42:02 +01:00
Peter Steinberger	b721f1dbad	fix: update Ollama web search endpoint	2026-04-25 22:34:43 +01:00
Cale Shapera	0bcb4c95c1	feat(tts): add Inworld speech provider (#55972 ) Adds the bundled Inworld speech provider with docs, config surface, SSRF-guarded fetches, directive overrides, native voice-note/telephony output coverage, and live `.profile` verification. Co-authored-by: cshape <cshape@users.noreply.github.com>	2026-04-25 22:33:21 +01:00
Peter Steinberger	2febe72108	fix: isolate ACP spawned runs	2026-04-25 22:06:53 +01:00
Peter Steinberger	9e9e024188	docs: clarify ACP model override support	2026-04-25 21:52:36 +01:00
Peter Steinberger	e2fd3dcee9	fix(google): emit opus voice-note tts	2026-04-25 21:33:33 +01:00
Tars	d5b6667823	fix(minimax): enable portal music and video generation	2026-04-25 21:30:10 +01:00
Peter Steinberger	6a7b76e119	fix(acp): guard sessions_spawn runtime targets	2026-04-25 21:23:24 +01:00
Vincent Koc	793b58b3f1	fix(plugins): add doctor registry repair	2026-04-25 12:45:43 -07:00
Peter Steinberger	75d64cd4b8	feat: expose generic image background option	2026-04-25 20:21:46 +01:00
Quratulain-bilal	7d58362f3f	docs(browser): note tilde expansion also covers per-profile paths (#71601 ) * docs(browser): note tilde expansion also covers per-profile paths The `95a2c9b` fix expanded "~" for both `browser.executablePath` and per-profile `profiles.<name>.executablePath` (config.ts:382 calls `normalizeExecutablePath` for profile overrides). Per-profile `userDataDir` on existing-session profiles is also tilde-expanded (config.ts:391 via `resolveUserPath`). The configuration reference only mentioned the top-level `browser.executablePath` case. * docs(browser): align tilde path config help --------- Co-authored-by: Peter Steinberger <steipete@gmail.com>	2026-04-25 20:05:03 +01:00
Quratulain-bilal	8170df9127	docs(browser): document local startup timeout bounds (#71672 ) * docs(browser): document local startup timeout bounds The new browser.localLaunchTimeoutMs and browser.localCdpReadyTimeoutMs options are clamped to MAX_BROWSER_STARTUP_TIMEOUT_MS (120000 ms) by normalizeStartupTimeoutMs in extensions/browser/src/browser/config.ts, and zero/negative/non-finite values fall back to the defaults. Without this in the configuration reference, users setting a higher value see no error and silently get the 120 s ceiling, or set 0 expecting 'no timeout' and silently get the default. * docs(browser): clarify startup timeout validation --------- Co-authored-by: Peter Steinberger <steipete@gmail.com>	2026-04-25 19:59:53 +01:00
Peter Steinberger	b66f01bdca	fix: expose transparent image infer options	2026-04-25 19:58:41 +01:00
91wan	bb2b68b34e	fix(acp): pass Codex ACP model thinking overrides Fix ACP Codex model/thinking override propagation.\n\nThanks @91wan.	2026-04-25 19:56:03 +01:00
Peter Steinberger	de0097a23c	fix: support transparent OpenAI image generation	2026-04-25 19:28:56 +01:00
Chris Zhang	c3bfd328ad	feat(litellm): add image generation provider (#70246 ) * feat(litellm): add image generation provider Registers litellm as an image-generation provider so model refs like litellm/gpt-image-2 route through the LiteLLM proxy, and agents.defaults.imageGenerationModel.fallbacks entries of the form litellm/... resolve without "No image-generation provider registered for litellm" errors. Implementation uses the OpenAI-compatible /images/generations and /images/edits endpoints that LiteLLM proxies for. BaseUrl resolves from models.providers.litellm.baseUrl (default http://localhost:4000). Private network is auto-allowed when baseUrl is a loopback/RFC1918 address, which covers the common self-hosted LiteLLM proxy case without needing OPENCLAW_PROVIDER_ALLOW_PRIVATE_NETWORK. Public baseUrls keep normal SSRF defaults. Default model is gpt-image-2 (matching upstream 4.21+ OpenAI default). Advertises the same 2K/4K sizes OpenAI now exposes, plus legacy 256/512/1024 for dall-e-3. Supports both generate and edit. Local patch. LiteLLM has no upstream image-generation support yet; revisit if upstream adds one. * ci: rerun after upstream main hot-fix * fix(litellm): harden image generation provider --------- Co-authored-by: Chris Zhang <chris@ChrisdeMac-mini.local> Co-authored-by: Peter Steinberger <steipete@gmail.com>	2026-04-25 19:06:51 +01:00
Peter Steinberger	9ffe764416	fix(whatsapp): send voice note text separately	2026-04-25 18:55:03 +01:00

1 2 3 4 5 ...

1008 Commits