Introducing Align Evals: Streamlining LLM Application Evaluation
Evaluations are a key technique for improving your application — whether you’re working on a single prompt or a complex agent. Iterating on evaluators has...
Keep external systems in sync with scheduled exports
You can now schedule automatic exports of your LangSmith traces, without needing to set up your own infrastructure. Whether you're syncing to a data...
View agent deployment metrics in LangSmith
You can now monitor your agent deployments directly from the LangSmith UI. View your deployment’s CPU & memory usage, API request latency, pending/active run...
Set expiration dates on LangSmith API keys!
You can now set expiration dates on LangSmith API keys! Adding expiration helps you: Learn how to set and manage API key expirations in our documentation.
Create custom views when viewing evaluation results
You can now create custom views to highlight important information when viewing evaluation results in LangSmith: Docs:...
Run Agent Evals in LangSmith Studio
Get started with evaluating your agent directly from the UI — you can now run evaluations over your agent in LangSmith Studio with no code required! Test how...
Use built-in tools in LangSmith's Playground!
You can now call built-in tools from OpenAI and Anthropic directly from within the LangSmith Playground. Pre-configured tools, such as web search and MCP,...
Track costs across multi-modal inputs and token-caching
Cost tracking is especially important for agentic applications because resource usage is determined dynamically by the agent itself. Our new cost-tracking...
Integrate LangSmith prompts with your SDLC
You can already test, version, and collaborate on prompts in LangSmith. Now, you can automatically sync those prompts to GitHub, external databases, or CI/CD...
Monthly usage charts for easy tracking in LangSmith
We’ve rolled out monthly usage charts for SaaS customers in LangSmith. You can now track all billable metrics (as defined in Metronome) in one place on the...