Close Menu
Pineapples Update –Pineapples Update –

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    What is MicroSD Express? Everything You Need To Know

    June 8, 2025

    5 to avoid pressure washing mistakes

    June 8, 2025

    Spain vs Portugal Live Stream: How to see the Rashtra League Final 2025 from anywhere and for free

    June 8, 2025
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Pineapples Update –Pineapples Update –
    • Home
    • Gaming
    • Gadgets
    • Startups
    • Security
    • How-To
    • AI/ML
    • Apps
    • Web3
    Pineapples Update –Pineapples Update –
    Home»Security»Cloud 4 benchmark reforms, but the reference is still 200K
    Security

    Cloud 4 benchmark reforms, but the reference is still 200K

    PineapplesUpdateBy PineapplesUpdateMay 23, 2025No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Cloud 4 benchmark reforms, but the reference is still 200K
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Cloud 4 benchmark reforms, but the reference is still 200K

    Today, Openai rival Anthropic announced the Cloud 4 model, which is much better than Cloud 3 in the benchmark, but we are disappointed with the same 200,000 reference window limits.

    In a blog post, Anthropic stated that Cloud Oppus is the most powerful model of the company, and it is also the best model for coding in the industry.

    Cloud 4

    For example, SWE-Bench (SWE Software is small for engineering benchmark), Cloud Ops 4 scored 72.5 percent and 43.2 on the terminal-bench.

    “It provides continuous performance on long -running tasks, which requires focused efforts and thousands of stages, with the ability to work continuously for several hours, dramatically improved all sonnet models better better and can complete AI agents,” anthropic noted,

    While the benchmark placed Cloud 4 Sonnet and Oppus in its predecessors and competitors such as Gemini 2.5 Pro coding, we are still concerned about the 200,000 reference window range of the model.

    Cloud benchmark

    This may be one of the reasons why Cloud 4 models in these benchmarks coding and complex-solving functions are excels, as these models are not being tested against a large reference.

    For comparison, the Gemini 2.5 Pro ship of Google with 1 million token reference window and support for 2 million reference windows is also in work.

    4.1 models of Chatgpt also provide up to a million reference window.




    Sample Description Input Prompt Caching Wright Prompt Caching Reed Production Reference window Batch resource discount
    Cloud Opus 4 The most intelligent model for complex tasks $ 15 / mtok $ 18.75 / Mtok $ 1.50 / mtok $ 75 / Mtok 200k 50% discount with batch processing
    Cloud sonnet 4 Optimal balance of intelligence, cost and speed $ 3 / mtok $ 3.75 / mtok $ 0.30 / Mtok $ 15 / mtok 200k 50% discount with batch processing

    Cloud is still behind the competition when it comes to the reference window, which is important in large projects.


    Red Report 2025

    Based on the analysis of 14M malicious tasks, search for the top 10 MITERAT & CK techniques behind the 93% attacks and how to defend them against them.

    200K benchmark cloud reference reforms
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleEnterprise Communication Evolution: Avaya’s infinity platform bridges the gap between today and is expected tomorrow
    Next Article A new look, apple intelligence and more
    PineapplesUpdate
    • Website

    Related Posts

    AI/ML

    AI working is a rapid network case, the latest benchmark test show

    June 8, 2025
    Security

    Remove project directors presented as malicious NPM package utilities

    June 8, 2025
    Security

    Supply series attacks Glustac NPM package with 960K weekly download

    June 7, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Microsoft’s new text editor is a VIM and Nano option

    May 19, 2025594 Views

    The best luxury car for buyers for the first time in 2025

    May 19, 2025536 Views

    Massives Datenleck in Cloud-Spichenn | CSO online

    May 19, 2025465 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    Meta delay entrusts ‘Bhamoth’ AI model, Openi and Google more than one more head start

    May 16, 20250 Views

    The OURA ring found a new rival with just one titanium design and 24/7 biometric tracking – no membership is required

    May 16, 20250 Views

    Filecoin, Lockheed Martin Test IPFS in space

    May 16, 20250 Views
    Our Picks

    What is MicroSD Express? Everything You Need To Know

    June 8, 2025

    5 to avoid pressure washing mistakes

    June 8, 2025

    Spain vs Portugal Live Stream: How to see the Rashtra League Final 2025 from anywhere and for free

    June 8, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms And Conditions
    • Disclaimer
    © 2025 PineapplesUpdate. Designed by Pro.

    Type above and press Enter to search. Press Esc to cancel.