Salesforce makes 'flight simulator' for AI agents as 95% of enterprise pilot fails to reach production

Want smart insight into your inbox? Enterprise AI, only what matters to data and security leaders, sign up for our weekly newspapers. Subscribe now

Sales force Conditions that rigorous test enterprise in simulated commercial environment will solve one of the biggest problems of artificial intelligence: agents who work in demonstrations but fail into dirty reality of corporate operations.

Cloud software giant unveiled three major AI research initiatives this week, including Crimena-ProWhat does it say “Digital twins“Hundreds of salesforce customer examples compromised after recent violations with comprehensive AI pilot failures and fresh safety concerns as enterprises to operate business where AI agents can be tasted before deployment.

“Pilots do not learn to fly in a storm; they train in flight simulators who push them to prepare the most extreme challenges,” said Silvio Savaris, chief scientist of salesfors and head of AI Research during a press conference. “Similarly, AI agents benefit from simulation testing and training, preparing them to handle the unexpectedness of daily business scenarios before their deployment.”

Research push shows the growing enterprise despair with AI implementation. A recent MIT report found that 95% of generative AI pilots in companies are failing to reach out to production, while their studies by salesfors show that large language models alone achieve only 35% success rates in complex commercial scenarios.

AI scaling hits its boundaries

Power caps, rising token costs, and entrance delays are re -shaping Enterprise AI. Join our exclusive salons to learn about top teams:

Transform energy into a strategic profit

Architecting efficient estimates for real thrruput benefits

Unlocking competitive ROI with sustainable AI system

Secure your location to stay ahead,

Digital twins for Enterprise AI: How Salesforce simulates real business chaos

Crimena-Pro The AI represents salesforce’s effort to bridge the gap between promise and performance. Unlike the existing benchmarks testing generic abilities, platforms evaluate agents on actual venture functions in the customer service escalation, sales forecast and supply chain using synthetic but realistic business data.

“If synthetic data is not cautious, it may be about misleading or optimistic consequences how well your agent performs in your real environment,” Jason WuA research manager at the salesfors who led the Krarena-Prote development.

The platform operates within the actual cellsforce production environment instead of the toy setup, using data valid by domain experts with relevant commercial experience. It supports both the business-to-business and business-to-consumer scenarios and can imitrse the multi-turn conversations catching real conversion dynamics.

The salesforce is using itself as “customer zero” to test these innovations internal. “Before we bring anything to the market, we will innovate in our team’s hands to test it out,” Muralidhar krishnaprasadChairman of Salesforce and CTO, during the press conference.

Five matrix that determine if your AI agent is enterprise-redress

Cellsforce introduced with simulation environment Agent benchmark for CRMFive important enterprises designed to evaluate AI agents in metrics: accuracy, cost, speed, trust and safety, and environmental stability.

Stability metric is particularly notable, with companies to align the size of the model with work complexity to reduce environmental impact while maintaining performance. “By cutting through the model overload noise, the benchmark gives a clear, data-driven way to connect the right models with the right agents,” the company said.

The benchmarking effort addresses a practical challenge before IT leaders: almost daily released with the new AI model, determining which people are suitable for specific business applications, have become rapidly difficult.

Why dirty enterprise data can create or break your AI finance

The third initiative focuses on a fundamental condition for reliable AI: clean, integrated data. Salesfors Account matching The capacity automatically uses fine-tuned language models to identify and integrate duplicate records in the system, identifying that “example company, ink.” And “example company” represent the same unit.

Data consolidation work emerged from a partnership between the research and product teams of the salesforce. Krishnaprasad said, “What identity resolve in data cloud is essentially, if you think of anything simple as a user, they have many, many, many, many IDs in many systems within any company,” Krishnaprasad explained.

A major cloud provider customer achieved a 95% match rate using technology, manually saving the cross-reference to the vendors to identify accounts and saved 30 minutes per connection by eliminating the requirement of several screens.

Earlier this month, announcements were made amid security concerns after a data theft campaign affecting 700 salesforce customer organizations. According to Google’s Threat Intelligence Group, Hackers exploited ougle tokens Salesloft’s flow from chat agent to reach salesforce examples and steal credentials for Amazon web services, snowflakes and other platforms.

Breach highlights the weaknesses in third-party integration depends on the venture AI-managed customer engagement. Salesforce has since Removed salesloft drift From the pending investigation from your appexache market.

The gap between AI Demo and Enterprise reality is bigger than you think

Simulation and benchmarking initiative indicates a comprehensive recognition that enterprise AI sinners require more than effective performance videos. The actual professional environment has heritage software, inconsistent data format and complex workflows that can also derail the sophisticated AI system.

“The main aspect that we want today we are discussing is continuity aspect, so how to ensure that we go into a kind of unsatisfactory performance from them, if you just plug an LM in cases of an enterprise use, which receives a lot of performance,” Savers said during the press conference.

The approach of salesfors emphasizes the need to work firmly in various scenarios rather than excellent performance in narrow functions. The concept of the company “Enterprise general intelligence“(EGI) focuses on building agents that are capable and consistent to perform complex business functions.

Like enterprises continue to invest in AI technologies, such as success of platforms Crimena-Pro It can determine whether the current wave of AI enthusiasm turns into permanent trade change or another example of technology promise than practical distribution.

Research initiative will be displayed Salesforce’s Dreamforce Conference in OctoberWhere the company is expected to declare additional AI development as it rapidly tries to maintain its leadership status in the AI market.

Daily insights on business use cases with VB daily

If you want to impress your boss, VB daily has covered you. We give you the scoop inside what companies are doing with generative AI, from regulatory changes to practical deployment, so you can share insight for maximum ROI.

Read our privacy policy

Thanks for membership. See more VB newsletters here.

There was an error.

What's Hot

Amazon Milwauki is selling 9 -tool kit from this Labor Day for $ 200 – what do you get here

Report: Samsung’s three-folded phones, XR headsets, and AI Smart Glasses should be revealed at the unpacked event on 29 September.

These XR glasses gave me a 200 -inch screen to work – and it is difficult to defeat the price

Amazon Milwauki is selling 9 -tool kit from this Labor Day for $ 200 – what do you get here

These XR glasses gave me a 200 -inch screen to work – and it is difficult to defeat the price

Tech founders should embrace market needs

Microsoft’s new text editor is a VIM and Nano option

The best luxury car for buyers for the first time in 2025

Massives Datenleck in Cloud-Spichenn | CSO online

Most Popular

10,000 steps or Japanese walk? We ask experts if you should walk ahead or fast

FIFA Club World Cup Soccer: Stream Palmirus vs. Porto lives from anywhere

What do chatbott is careful about punctuation? I tested it with chat, Gemini and Cloud

Our Picks

Amazon Milwauki is selling 9 -tool kit from this Labor Day for $ 200 – what do you get here

Report: Samsung’s three-folded phones, XR headsets, and AI Smart Glasses should be revealed at the unpacked event on 29 September.

These XR glasses gave me a 200 -inch screen to work – and it is difficult to defeat the price

Subscribe to Updates

What's Hot

Salesforce makes ‘flight simulator’ for AI agents as 95% of enterprise pilot fails to reach production

Digital twins for Enterprise AI: How Salesforce simulates real business chaos

Five matrix that determine if your AI agent is enterprise-redress

Why dirty enterprise data can create or break your AI finance

The gap between AI Demo and Enterprise reality is bigger than you think

Related Posts

Subscribe to Updates