Re - feat(pipecat-sdk): add speech-to-speech model support (Gemini Live) #683

Prasanna721 · 2026-01-19T20:49:07Z

RE-RAISING Pipecat live speech PR

Added native speech-to-speech model support

Summary:

Speech-to-speech support - Auto-detect audio frames and inject memories to system prompt for native audio models (Gemini Live, etc.)
Fix memory bloating - Replace memories each turn using XML tags instead of accumulating
Add temporal context - Show recency on search results ([2d ago], [15 Jan])
New inject_mode param - auto (default), system, or user

Docs update

Update the docs for native speech-2-speech models

cloudflare-workers-and-pages · 2026-01-19T20:49:13Z

Deploying with Cloudflare Workers

The latest updates on your project. Learn more about integrating Git with Workers.

Status	Name	Latest Commit	Preview URL	Updated (UTC)
⛔ Deployment terminated View logs	supermemory-app	`0a8c5fa`	Commit Preview URL Branch Preview URL	Jan 21 2026, 04:19 AM

Prasanna721 · 2026-01-19T20:49:24Z

Re - feat(pipecat-sdk): add speech-to-speech model support (Gemini Live) #683 👈 (View in Graphite)
main

How to use the Graphite Merge Queue

Add the label Main to this PR to add it to the merge queue.

You must have a Graphite account in order to use the merge queue. Sign up using this link.

_{An organization admin has enabled the Graphite Merge Queue in this repository.} _{Please do not merge from GitHub as this will restart CI on PRs being processed by the merge queue.}

This stack of pull requests is managed by Graphite. Learn more about stacking.

packages/pipecat-sdk-python/src/supermemory_pipecat/service.py

+                if MEMORY_TAG_PATTERN.search(existing_content):
+                    messages[system_idx]["content"] = MEMORY_TAG_PATTERN.sub(
+                        tagged_memory, existing_content
+                    )
+                else:


claude · 2026-01-19T20:54:38Z

Code review

No issues found. Checked for bugs and CLAUDE.md compliance.

graphite-app · 2026-01-21T03:58:09Z

Merge activity

Jan 21, 3:58 AM UTC: Dhravya added this pull request to the Graphite merge queue.
Jan 21, 3:59 AM UTC: Merged by the Graphite merge queue.

…ve) (#683) #### RE-RAISING Pipecat live speech PR ### Added native speech-to-speech model support ### Summary: - Speech-to-speech support - Auto-detect audio frames and inject memories to system prompt for native audio models (Gemini Live, etc.) - Fix memory bloating - Replace memories each turn using XML tags instead of accumulating - Add temporal context - Show recency on search results ([2d ago], [15 Jan]) - New inject_mode param - auto (default), system, or user ### Docs update - Update the docs for native speech-2-speech models

sentry · 2026-01-21T04:02:03Z

packages/pipecat-sdk-python/src/supermemory_pipecat/utils.py

 """Utility functions for Supermemory Pipecat integration."""

-from typing import Dict, List
+from datetime import datetime, timezone
+from typing import Any, Dict, List, Union


 def get_last_user_message(messages: List[Dict[str, str]]) -> str | None:


Bug: The get_last_user_message function doesn't handle multimodal message content (lists), which will cause memory retrieval to fail for those messages.
_{Severity: MEDIUM}

Suggested Fix

Update get_last_user_message to handle cases where message['content'] is a list. Check if the content is a list and, if so, iterate through its parts to extract and join the text content into a single string, similar to the implementation in supermemory_openai/utils.py.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: packages/pipecat-sdk-python/src/supermemory_pipecat/utils.py#L1-L7 Potential issue: The function `get_last_user_message` in `supermemory_pipecat/utils.py` assumes that the `content` of a message is always a string. However, Pipecat messages can also contain a list for multimodal content, a scenario more likely in speech-to-speech pipelines which this pull request supports. When a message with list content is processed, the function will return a list instead of a string. This list is then passed to `_retrieve_memories`, which expects a string query, causing the memory retrieval API call to fail. The failure is caught and logged, but it silently breaks the memory feature for any multimodal user inputs.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

mintlify bot deployed to staging - apps/docs January 19, 2026 20:49 View deployment

Prasanna721 self-assigned this Jan 19, 2026

Prasanna721 requested a review from Dhravya January 19, 2026 20:50

Prasanna721 marked this pull request as ready for review January 19, 2026 20:51

sentry bot reviewed Jan 19, 2026

View reviewed changes

packages/pipecat-sdk-python/src/supermemory_pipecat/service.py

Comment on lines +252 to +256

if MEMORY_TAG_PATTERN.search(existing_content):

messages[system_idx]["content"] = MEMORY_TAG_PATTERN.sub(

tagged_memory, existing_content

)

else:

This comment was marked as outdated.

Sign in to view

Dhravya approved these changes Jan 21, 2026

View reviewed changes

graphite-app bot force-pushed the pipecat-update branch from 245ea24 to 0a8c5fa Compare January 21, 2026 03:58

mintlify bot deployed to staging - apps/docs January 21, 2026 03:59 View deployment

graphite-app bot merged commit 0a8c5fa into main Jan 21, 2026
5 of 8 checks passed

mintlify bot deployed to staging - apps/docs January 21, 2026 04:00 View deployment

sentry bot reviewed Jan 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Re - feat(pipecat-sdk): add speech-to-speech model support (Gemini Live) #683

Re - feat(pipecat-sdk): add speech-to-speech model support (Gemini Live) #683

Prasanna721 commented Jan 19, 2026 •

edited

Loading

Uh oh!

cloudflare-workers-and-pages bot commented Jan 19, 2026 •

edited

Loading

Uh oh!

Prasanna721 commented Jan 19, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

claude bot commented Jan 19, 2026

Uh oh!

graphite-app bot commented Jan 21, 2026 •

edited

Loading

Uh oh!

Uh oh!

sentry bot Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Re - feat(pipecat-sdk): add speech-to-speech model support (Gemini Live) #683

Re - feat(pipecat-sdk): add speech-to-speech model support (Gemini Live) #683

Conversation

Prasanna721 commented Jan 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

RE-RAISING Pipecat live speech PR

Added native speech-to-speech model support

Summary:

Docs update

Uh oh!

cloudflare-workers-and-pages bot commented Jan 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying with Cloudflare Workers

Uh oh!

Prasanna721 commented Jan 19, 2026

How to use the Graphite Merge Queue

Uh oh!

This comment was marked as outdated.

Uh oh!

claude bot commented Jan 19, 2026

Code review

Uh oh!

graphite-app bot commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge activity

Uh oh!

Uh oh!

sentry bot Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Prasanna721 commented Jan 19, 2026 •

edited

Loading

cloudflare-workers-and-pages bot commented Jan 19, 2026 •

edited

Loading

graphite-app bot commented Jan 21, 2026 •

edited

Loading