Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
22 commits
Select commit Hold shift + click to select a range
f356c23
Add server-specific optimization with automatic server detection and …
petersonbill64 Jul 17, 2025
fc0c7de
initial step - needs revision - LLMAsJudge to evaluate the successful…
petersonbill64 Jul 18, 2025
9010396
LLM Judge Successful - needs revision
petersonbill64 Jul 21, 2025
8a07e29
include LLM-as-Judge task success evaluation with scoring criteria an…
petersonbill64 Jul 22, 2025
67351ff
Merge branch 'main' into mahtab_prompt_optimize
petersonbill64 Jul 22, 2025
451c5d1
Merge branch 'main' into mahtab_prompt_optimize
petersonbill64 Jul 28, 2025
ec51b2f
- Fixing the truncated message - using llm to summarize the long message
petersonbill64 Jul 29, 2025
7d80afd
.
petersonbill64 Jul 29, 2025
19fef96
Merge branch 'main' into mahtab_prompt_optimize
petersonbill64 Jul 29, 2025
fd65079
- inclusion of server name
petersonbill64 Jul 30, 2025
cf2d0e9
- Enhanced trace processing
petersonbill64 Aug 1, 2025
70be221
Merge branch 'main' into mahtab_prompt_optimize
petersonbill64 Aug 4, 2025
6f0cbb4
Enhance trace processing and dataset extraction
petersonbill64 Aug 5, 2025
ae011bb
minor changes
petersonbill64 Aug 5, 2025
b840bd1
minor changes
petersonbill64 Aug 5, 2025
5ba742e
Merge branch 'main' into mahtab_prompt_optimize
petersonbill64 Aug 8, 2025
2a37e8d
.
petersonbill64 Aug 13, 2025
5d9ffb5
Merge branch 'main' into mahtab_prompt_optimize
petersonbill64 Aug 13, 2025
29c8fbc
Merge branch 'main' into mahtab_prompt_optimize
petersonbill64 Aug 15, 2025
9c6e99b
improved prompt for toll docstring optimizer
petersonbill64 Aug 15, 2025
52de158
- OPTIM_PROCESS.md posted
petersonbill64 Aug 18, 2025
0436604
Merge branch 'main' into mahtab_prompt_optimize
petersonbill64 Aug 25, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,4 +25,5 @@ MCP-Eval is a developer-first testing framework for Model Context Protocol (MCP)
Install `mcp_eval` and its dependencies. Make sure `mcp-agent` is also installed in your environment.

```bash
pip install "typer[all]" rich pydantic jinja2
pip install "typer[all]" rich pydantic jinja2
```
4 changes: 3 additions & 1 deletion examples/mcp_server_fetch/tests/test_pytest_style.py
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,9 @@ async def test_basic_fetch_with_pytest(mcp_agent: TestAgent):
response = await mcp_agent.generate_str(
"Fetch the content from https://example.com"
)

response = await mcp_agent.generate_str(
"Fetch the content from https://bbc.com"
)
# Modern evaluator approach
await mcp_agent.session.assert_that(
Expect.tools.was_called("fetch"), name="fetch_tool_called", response=response
Expand Down
Loading