-
Notifications
You must be signed in to change notification settings - Fork 546
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[draft] feat: multi-task agents #1270
Open
longcw
wants to merge
29
commits into
main
Choose a base branch
from
longc/multi-stage-agent
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+1,804
−79
Open
Changes from 13 commits
Commits
Show all changes
29 commits
Select commit
Hold shift + click to select a range
f65c990
feat: multi stage agent testing
longcw 9e1faa2
update multi stage example with better transfer
longcw 50f9123
fix: add filler messages and fix transfer_to_spec
longcw 98144ec
log the chat ctx to file
longcw 9b0b9a3
Merge branch 'main' into longc/multi-stage-agent
longcw a4712fd
add agent task
longcw fac0f53
add a new example for agent task
longcw 5d49100
add checkout task
longcw da9da7b
improve multi task example
longcw 3e7cede
clean example
longcw 726ed32
refactor the AgentTask
longcw fcea5a7
Merge branch 'main' into longc/multi-stage-agent
longcw d16c847
add news mailer example
longcw 92ab58d
update restaurant instrunctions
longcw a94f06d
rename to transfer_function_description
longcw cb2a378
Merge remote-tracking branch 'origin/main' into longc/multi-stage-agent
longcw 4998b57
clean test files
longcw c89956d
update agent task API
longcw 3fb2da4
Merge branch 'main' into longc/multi-stage-agent
longcw c4fa665
Merge remote-tracking branch 'origin/main' into longc/multi-stage-agent
longcw e186426
add inline fnc
longcw 351a6a8
fix chat ctx injection
longcw 2cfac17
imporve the inline example
longcw 5581541
append tranfer functions to original task
longcw 33bb51c
Merge remote-tracking branch 'origin/main' into longc/multi-stage-agent
longcw 0524bb4
fix lint and types
longcw d522c03
add multi task agent for realtime api
longcw 4c67472
insert the user data as system messages
longcw ecf6606
fix chat ctx truncate
longcw File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
204 changes: 204 additions & 0 deletions
204
examples/voice-pipeline-agent/multi_task/news_mailer.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,204 @@ | ||
import asyncio | ||
import json | ||
import logging | ||
from typing import Annotated, TypedDict | ||
|
||
from dotenv import load_dotenv | ||
from livekit import rtc | ||
from livekit.agents import ( | ||
AutoSubscribe, | ||
JobContext, | ||
JobProcess, | ||
WorkerOptions, | ||
cli, | ||
llm, | ||
) | ||
from livekit.agents.pipeline import AgentCallContext, VoicePipelineAgent | ||
from livekit.agents.pipeline.agent_task import AgentTask | ||
from livekit.agents.stt import SpeechData, SpeechEvent, SpeechEventType | ||
from livekit.plugins import deepgram, openai, silero | ||
|
||
load_dotenv() | ||
|
||
logger = logging.getLogger("news-mailer") | ||
logger.setLevel(logging.INFO) | ||
|
||
|
||
class UserData(TypedDict): | ||
query: str | None | ||
news: str | None | ||
email: str | None | ||
|
||
|
||
@llm.ai_callable() | ||
async def query_news( | ||
query: Annotated[str, llm.TypeInfo(description="The query user asked for")], | ||
) -> str: | ||
"""Called to query news from the internet. | ||
Tell the user you are checking the news when calling this function.""" | ||
logger.info(f"Querying news for {query}") | ||
perplexity_llm = openai.LLM.with_perplexity( | ||
model="llama-3.1-sonar-small-128k-online" | ||
) | ||
chat_ctx = llm.ChatContext().append( | ||
role="system", | ||
text="Search the recent news articles about the query.", | ||
) | ||
chat_ctx.append(role="user", text=query) | ||
llm_stream = perplexity_llm.chat(chat_ctx=chat_ctx) | ||
news = "" | ||
async for chunk in llm_stream: | ||
if not chunk or not chunk.choices or not chunk.choices[0].delta.content: | ||
continue | ||
news += chunk.choices[0].delta.content | ||
|
||
agent = AgentCallContext.get_current().agent | ||
user_data: UserData = agent.user_data | ||
user_data["query"] = query | ||
user_data["news"] = news | ||
logger.info(f"The news about {query} collected") | ||
return news | ||
|
||
|
||
@llm.ai_callable() | ||
async def send_news_email() -> str: | ||
"""Called to send the news to the user's email address.""" | ||
agent = AgentCallContext.get_current().agent | ||
user_data: UserData = agent.user_data | ||
email = user_data.get("email") | ||
news = user_data.get("news") | ||
|
||
if not email: | ||
return "email is not collected" | ||
|
||
if not news: | ||
return "news is not collected" | ||
|
||
# mock sending email | ||
query = user_data.get("query") | ||
logger.info(f"Sending news about {query} to {email}") | ||
await asyncio.sleep(2) | ||
return f"The news about {query} is sent to {email}" | ||
|
||
|
||
@llm.ai_callable() | ||
async def verify_email( | ||
email: Annotated[str, llm.TypeInfo(description="The collected email address")], | ||
) -> str: | ||
"""Called to verify the user's email address.""" | ||
if "@" not in email: | ||
return "The email address is not valid, please confirm with the user." | ||
|
||
# Potentially show the email on the screen for the user to confirm | ||
return "The email address is valid. Please confirm with the user for the spelling." | ||
|
||
|
||
@llm.ai_callable() | ||
async def update_email( | ||
email: Annotated[str, llm.TypeInfo(description="The collected email address")], | ||
) -> str: | ||
"""Called to update the user's email address.""" | ||
|
||
agent = AgentCallContext.get_current().agent | ||
user_data: UserData = agent.user_data | ||
user_data["email"] = email | ||
logger.info(f"The email is updated to {email}") | ||
return f"The email is updated to {email}." | ||
|
||
|
||
news_mailer = AgentTask( | ||
name="news_mailer", | ||
instructions=( | ||
"You are a friendly assistant that can query news from the internet." | ||
"Summarize the news in 50 words or less and ask the user if they want to receive the news by email." | ||
"Use email_collector to collect the user's email address." | ||
), | ||
functions=[query_news, send_news_email], | ||
) | ||
|
||
email_collector = AgentTask( | ||
name="email_collector", | ||
instructions=( | ||
"You are a friendly assistant that can collect the user's email address. Your tasks:\n" | ||
"1. Collect the user's email address, help to complete the @ and domain part if possible.\n" | ||
"2. Verify the address with `verify_email` function until the user confirms.\n" | ||
"3. Update the email address after the user confirms.\n" | ||
"Transfer back to news_mailer after the email is updated." | ||
), | ||
functions=[update_email, verify_email], | ||
) | ||
|
||
|
||
async def entrypoint(ctx: JobContext): | ||
await ctx.connect(auto_subscribe=AutoSubscribe.AUDIO_ONLY) | ||
|
||
chat_log_file = "news_mailer.log" | ||
|
||
# Set up chat logger | ||
chat_logger = logging.getLogger("chat_logger") | ||
chat_logger.setLevel(logging.INFO) | ||
handler = logging.FileHandler(chat_log_file) | ||
formatter = logging.Formatter("%(message)s") | ||
handler.setFormatter(formatter) | ||
chat_logger.addHandler(handler) | ||
|
||
participant = await ctx.wait_for_participant() | ||
agent = VoicePipelineAgent( | ||
vad=ctx.proc.userdata["vad"], | ||
stt=deepgram.STT(), | ||
llm=openai.LLM(), | ||
tts=openai.TTS(), | ||
initial_task=news_mailer, | ||
available_tasks=[news_mailer, email_collector], | ||
max_nested_fnc_calls=3, # may call functions in the transition function | ||
) | ||
|
||
# read text input from the room for easy testing | ||
@ctx.room.on("data_received") | ||
def on_data_received(packet: rtc.DataPacket): | ||
if packet.topic == "lk-chat-topic": | ||
data = json.loads(packet.data.decode("utf-8")) | ||
logger.debug("Text input received", extra={"message": data["message"]}) | ||
|
||
agent._human_input.emit( | ||
"final_transcript", | ||
SpeechEvent( | ||
type=SpeechEventType.END_OF_SPEECH, | ||
alternatives=[SpeechData(language="en", text=data["message"])], | ||
), | ||
) | ||
|
||
# write the chat log to a file | ||
@agent.on("user_speech_committed") | ||
@agent.on("agent_speech_interrupted") | ||
@agent.on("agent_speech_committed") | ||
def on_speech_committed(message: llm.ChatMessage): | ||
chat_logger.info(f"{message.role}: {message.content}") | ||
|
||
@agent.on("function_calls_collected") | ||
def on_function_calls_collected(calls: list[llm.FunctionCallInfo]): | ||
fnc_infos = [{fnc.function_info.name: fnc.arguments} for fnc in calls] | ||
chat_logger.info(f"fnc_calls_collected: {fnc_infos}") | ||
|
||
@agent.on("function_calls_finished") | ||
def on_function_calls_finished(calls: list[llm.CalledFunction]): | ||
called_infos = [{fnc.call_info.function_info.name: fnc.result} for fnc in calls] | ||
chat_logger.info(f"fnc_calls_finished: {called_infos}") | ||
|
||
# Start the assistant. This will automatically publish a microphone track and listen to the participant. | ||
agent.start(ctx.room, participant) | ||
await agent.say("Welcome to news mailer! How may I assist you today?") | ||
|
||
|
||
def prewarm_process(proc: JobProcess): | ||
# preload silero VAD in memory to speed up session start | ||
proc.userdata["vad"] = silero.VAD.load() | ||
|
||
|
||
if __name__ == "__main__": | ||
cli.run_app( | ||
WorkerOptions( | ||
entrypoint_fnc=entrypoint, | ||
prewarm_fnc=prewarm_process, | ||
), | ||
) |
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
how does it know which tasks to use in this case? is it purely from the instructions indicating
Use email_collector to collect the user's email address.
.. does that meanemail_collector
is represented as a function?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
each task has a built in
transfer_to
method that will be added tofnc_ctx
if thecan_enter
of the task returns true.