LLM Testing - Search News

TruEra launches free tool for testing LLM apps for hallucinations

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now TruEra, a vendor providing tools to test, ...

InfoWorld

How to choose the best LLM using R and vitals

Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...

Communications of the ACM

LLM Evaluation is Key to Accurate, Reliable, Effective GenAI

Enter large language model (LLM) evaluation. The purpose of LLM evaluation is to analyze and refine GenAI outputs to improve their accuracy and reliability while avoiding bias. The evaluation process ...

SiliconANGLE

Generative AI app testing platform Gentrace raises $8M to make LLM development more accessible

Gentrace, a developer platform for testing and monitoring artificial intelligence applications, said today it has raised $8 million in an early-stage funding round led by Matrix Partners to expand ...

Security Boulevard

Large Language Model (LLM) integration risks for SaaS and enterprise

The rapid adoption of Large Language Models (LLMs) is transforming how SaaS platforms and enterprise applications operate.

SiliconANGLE

Patronus AI reels in $17M for its AI reliability testing platform

Patronus AI Inc., a startup helping companies detect and fix reliability issues in their large language models, today announced that it has closed a $17 million investment. Notable Capital led the ...

12d

LLM Security Isn’t Just Theoretical—It’s A QA Problem You Can Test

As a QA leader, there are many practical items that can be checked, and each has a success test. The following list outlines what you need to know: • Source Hygiene: Content needs to come from trusted ...

Finextra

Testing Gen AI Applications

When we start thinking about Generative AI, there are 2 things that come to mind, one is relative to the GenAI model itself with its countless possibilities and next is the application with definitive ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results