Replace eval() with ast.literal_eval() to mitigate code injection vulnerability #2769

K4TV · 2025-08-21T23:50:59Z

Description
Replace eval() with ast.literal_eval() to prevent code injection. This PR addresses the code injection vulnerability (CWE-94) identified in the function_message function where ‘eval()’ was being used to parse function arguments. The use of ‘eval()’ creates a vulnerability that could allow arbitrary code execution if an attacker could control the input message.

Changes Made:

Replaced all instances of eval() with ast.literal_eval() in interface.py
Added import ast where needed
Maintained existing functionality while improving security

How to test

create a test file with the following content:

import ast
 
def archival_memory_insert(content):
    print("Code inserted.")

def function_message(msg):
    if msg.startswith("Running "):
        function_args = msg[len("Running "):].strip()
        print(f"Detected function call: {function_args}")
        msg_dict = eval(function_args)  # this is where eval() is used

def function_message_literal(msg):
    if msg.startswith("Running "):
        function_args = msg[len("Running "):].strip()
        print(f"Detected function call - literal: {function_args}")
        msg_dict = ast.literal_eval(function_args)  # this is where ast.literal_eval() is used

if __name__ == "__main__":
    # test inputs
    test_input = "Running archival_memory_insert({'key': 'value'})"
    exploit_input = "Running archival_memory_insert(__import__('os').system('echo vulnerable'))"
    exploit_input2 = "Running archival_memory_insert(__import__('os').system('echo *'))"

    print("--------------- Testing eval() ---------------")
    print("eval() - DANGEROUS (executes code):")
    function_message(test_input)
    print("eval() - DANGEROUS (executes code):")
    function_message(exploit_input)
    print("eval() - DANGEROUS (executes code): \n")
    function_message(exploit_input2)

    print("\n----------Testing malicious input----------")
    print("literal_eval() - SAFE (blocks code):")
    try:
        function_message_literal(exploit_input)
    except ValueError as e:
        print(f"ValueError: {e}")
    
    try:
        function_message_literal(exploit_input2)
    except ValueError as e:
        print(f"ValueError: {e}")

run the test

python3 [test_file_name]

Have you tested this PR?
Yes these were the following results:

# test cases
--------------- Testing eval() ---------------
eval() - DANGEROUS (executes code):
Detected function call: archival_memory_insert({'key': 'value'})
Code inserted.
eval() - DANGEROUS (executes code):
Detected function call: archival_memory_insert(__import__('os').system('echo vulnerable'))
vulnerable
Code inserted.
eval() - DANGEROUS (executes code): 

Detected function call: archival_memory_insert(__import__('os').system('echo *'))
__init__.py __pycache__ clear_postgres_db.py code_inject.py config.py configs conftest.py constants.py data helpers integration_test_agent_tool_graph.py integration_test_async_tool_sandbox.py integration_test_batch_api_cron_jobs.py integration_test_batch_sdk.py integration_test_builtin_tools.py integration_test_chat_completions.py integration_test_composio.py integration_test_multi_agent.py integration_test_pinecone_tool.py integration_test_send_message.py integration_test_sleeptime_agent.py integration_test_summarizer.py integration_test_tool_execution_sandbox.py integration_test_voice_agent.py manual_test_many_messages.py manual_test_multi_agent_broadcast_large.py mcp pytest.ini sdk test_agent_files test_agent_serialization.py test_agent_serialization_v2.py test_base_functions.py test_cli.py test_client.py test_file_processor.py test_google_embeddings.py test_letta_agent_batch.py test_letta_request_schema.py test_llm_clients.py test_managers.py test_memory.py test_multi_agent.py test_optimistic_json_parser.py test_plugins.py test_provider_trace.py test_providers.py test_redis_client.py test_sdk_client.py test_server.py test_sources.py test_static_buffer_summarize.py test_stream_buffer_readers.py test_timezone_formatting.py test_tool_rule_solver.py test_tool_sandbox test_tool_schema_parsing.py test_tool_schema_parsing_files test_utils.py utils.py
Code inserted.

----------Testing malicious input----------
literal_eval() - SAFE (blocks code):
Detected function call - literal: archival_memory_insert(__import__('os').system('echo vulnerable'))
ValueError: malformed node or string on line 1: <ast.Call object at 0x7e7f6c71b450>
Detected function call - literal: archival_memory_insert(__import__('os').system('echo *'))
ValueError: malformed node or string on line 1: <ast.Call object at 0x7e7f6c71b4d0>

Related issues or PRs
#2613

Is your PR over 500 lines of code?
No

Additional context
ast.literal_eval() is much safer than eval() as it only evaluates literals and does not execute arbitrary code. This prevents malicious input from being executed as python code while still allowing the intended functionality of parsing dictionary arguments.

The files shown above are under the folder tests and for testing purposes our test file is named code_inject.py.

Co-authored-by: Jin Peng <[email protected]>

Co-authored-by: Eric Ly <[email protected]>

…g mcp servers Co-authored-by: Jin Peng <[email protected]>

Co-authored-by: Jin Peng <[email protected]>

Co-authored-by: Shubham Naik <[email protected]>

Co-authored-by: Kevin Lin <[email protected]> Co-authored-by: Matthew Zhou <[email protected]> Co-authored-by: Kian Jones <[email protected]> Co-authored-by: Andy Li <[email protected]> Co-authored-by: jnjpng <[email protected]> Co-authored-by: Jin Peng <[email protected]> Co-authored-by: Eric Ly <[email protected]> Co-authored-by: Eric Ly <[email protected]> Co-authored-by: Shubham Naik <[email protected]> Co-authored-by: Shubham Naik <[email protected]>

…595)

Co-authored-by: Jin Peng <[email protected]>

…ool config Co-authored-by: Jin Peng <[email protected]>

Co-authored-by: Shubham Naik <[email protected]>

…okens for google reasoning models Co-authored-by: Jin Peng <[email protected]>

mattzh72 and others added 30 commits July 24, 2025 10:53

feat: Add timeout to file processing (#3513)

c94e048

fix: Fix state transitions for file processing (#3541)

c72b805

feat: Lower file config defaults for larger models (#3543)

a128484

fix: Adjust immediate complete for pinecone file state machine (#3546)

6540c1d

fix: structured outputs for send_message, LettaMessage

16d1b4a

fix: Fix test managers state transition tests (#3551)

8a759e9

fix: error logging for stop reasons

81eba7b

feat: support mcp server export/import with agent files

92295e7

Co-authored-by: Jin Peng <[email protected]>

feat: add support for oauth mcp

5151e2e

Co-authored-by: Jin Peng <[email protected]>

feat: agent tags reverse index

b5a55e5

revert: access_key_id to access_key (#3556)

4843f82

Co-authored-by: Eric Ly <[email protected]>

fix: test_agent_serialization_v2.py and use bulk fetch when fetchin…

e36309d

…g mcp servers Co-authored-by: Jin Peng <[email protected]>

feat: support for project_id and backfills

1b02a1d

fix: catch and log mcp tool list exceptions

08832a4

Co-authored-by: Jin Peng <[email protected]>

feat: profiling middleware

fa8025a

chore: remove backfill

2dce59f

fix: catch import error

66888c5

fix: stdio form regression with streamable oauth

7176006

Co-authored-by: Jin Peng <[email protected]>

chore: restrict custom embeddings in cloud (#3578)

1d17cd8

Co-authored-by: Shubham Naik <[email protected]>

feat: asyncify jinja templates (#3580)

f4a275e

fix: sources test (#3581)

8cadb80

Merge branch 'main' into revert-squash-merge

cce3a4a

fix: sources unit test (letta-ai#2735)

f88f813

fix: Adjust summarize_agent_conversation endpoint to return 204 (#3…

0071581

…595)

feat: Add ability to disable reasoning (#3594)

c3cbee4

feat: allow mcp authentication overrides per agent (#3318)

e69fd22

Co-authored-by: Jin Peng <[email protected]>

feat: Scrub inner thoughts from history on toggle (#3607)

9535b57

feat: ai tool helper system prompt (#3613)

7d561d0

feat: pass in generated pip requirement to tool update (#3614)

b658db0

carenthomas and others added 23 commits August 18, 2025 16:26

feat: add azure and together to new agent loop (#3987)

f8bd4fb

feat: introduce asyncio shield to stream response (#3992)

2c84225

feat: make enable reasoner default true for agent creation (#3996)

a217b7a

fix: tool logging error (#3998)

7323e0c

feat: add null check to step logging LET-3890 (#4004)

2bb73da

feat: catch closed resource error in stream processing (#4003)

1a23669

fix: require function declarations to be present for setting gemini t…

715b896

…ool config Co-authored-by: Jin Peng <[email protected]>

feat: set frequency penalty to 1 for letta free (#4009)

8892773

fix: type error for int comparison (#4010)

5113fa6

Shub/pro 848 use oxide lint over eslint (#4007)

41291ed

Co-authored-by: Shubham Naik <[email protected]>

feat: only stream last chunk if client is connected (#4015)

8218f9a

feat: always override thinking budget for anthropic if not min (#4019)

ddbfccf

fix: include google_ai model endpoint type when setting reasoning t…

dae6228

…okens for google reasoning models Co-authored-by: Jin Peng <[email protected]>

feat: Fix test managers (#4024)

027ad15

feat: add resource closed errors throughout stream (#4021)

bad32af

feat: improve error message for vertex response parsing (#4043)

b81955a

fix(ci): test_managers missing uuid import (#4049)

e337568

feat: Add refresh functionality for files (#4053)

54568e5

fix: Fix letta-free embeddings (#4055)

fdaf808

feat: Allow renaming files on upload (#4061)

0b97eb3

Merge branch 'main' into bump-11-4

28e8837

bump version

3c9e135

chore: bump v0.11.4 (letta-ai#2767)

794ed7d

github-project-automation bot added this to 🐛 Letta issue tracker Aug 21, 2025

github-project-automation bot moved this to To triage in 🐛 Letta issue tracker Aug 21, 2025

Merge branch 'letta-ai:main' into team41_CTI

c312fad

sarahwooders added the safe to test Lets CI tests which require secrets to be ran label Aug 22, 2025

kianjones9 force-pushed the main branch 2 times, most recently from b6907a4 to ae775db Compare September 15, 2025 22:26

carenthomas force-pushed the main branch from 42a8fad to 30781c4 Compare October 8, 2025 01:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Replace eval() with ast.literal_eval() to mitigate code injection vulnerability #2769

Replace eval() with ast.literal_eval() to mitigate code injection vulnerability #2769

Uh oh!

K4TV commented Aug 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

14 participants

Replace eval() with ast.literal_eval() to mitigate code injection vulnerability #2769

Are you sure you want to change the base?

Replace eval() with ast.literal_eval() to mitigate code injection vulnerability #2769

Uh oh!

Conversation

K4TV commented Aug 21, 2025

run the test

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

14 participants