Everything about AI and Technology

Nebius Secures $17.4 Billion AI Infrastructure Deal with Microsoft

On September 8, 2025, Nebius Group, an Amsterdam-based AI infrastructure company, announced a landmark $17.4 billion deal to supply Microsoft with GPU infrastructure capacity over a five-year term, with the potential to reach $19.4 billion if Microsoft opts for additional services. This agreement, one of the largest in the AI infrastructure sector, sent Nebius shares soaring by over 47% in after-hours trading, with some reports noting a peak surge of 60%. The deal underscores the surging demand for high-performance computing resources as tech giants race to bolster their AI capabilities.

Nebius, which emerged from a 2023 split of Russian tech giant Yandex, will provide Microsoft with dedicated GPU capacity from its new data center in Vineland, New Jersey, starting later in 2025. The company specializes in delivering Nvidia GPUs and AI cloud services, offering computing, storage, and tools for AI developers to build and run models. The agreement is expected to significantly accelerate Nebius’s AI cloud business, with CEO Arkady Volozh stating, “The economics of the deal are attractive, but significantly, it will help us accelerate growth in 2026 and beyond.” Nebius plans to finance the data center’s construction and chip purchases using cash flow from the deal and potential debt issuance.

This partnership highlights the intense competition for AI infrastructure, with Microsoft, a key player in AI through its Azure platform and OpenAI partnership, seeking to secure robust computing resources. Unlike its competitor CoreWeave, which saw a modest 5% stock increase, Nebius’s deal positions it as a major player in the AI supply chain. The contract’s scale—exceeding Nebius’s projected 2025 revenue of $625 million by nearly three times annually—has fueled optimism among investors, with some on X predicting a potential 10x growth for Nebius due to ongoing GPU supply constraints.

However, the deal also raises questions about market dynamics. Microsoft, previously CoreWeave’s largest customer, had not signed a comparable long-term contract with them, and CoreWeave denied earlier reports of contract cancellations. Nebius’s agreement could signal a strategic shift for Microsoft, possibly diversifying its AI infrastructure partners. Meanwhile, Nebius’s U.S. expansion, with offices in San Francisco, Dallas, and New York, and its doubled market cap of $15 billion in 2025, reflect its aggressive push into the American market.

Despite the enthusiasm, some analysts remain cautious. Posts on X and reports note that while Nebius benefits from current GPU shortages, its long-term viability depends on innovation beyond merely supplying Nvidia chips. The company’s additional ventures in autonomous driving (Avride) and edtech (TripleTen) may diversify its portfolio, but financial health concerns, including declining revenues and weak cash flow, persist. As the AI infrastructure race intensifies, Nebius’s ability to capitalize on this deal and secure further contracts will be critical to sustaining its meteoric rise.

Stock Surges on Deal Announcement: Bloomberg reported that Nebius shares gained about 60% in late trading following the announcement, while Microsoft stock remained largely unchanged. The surge comes after Nebius shares had already more than doubled this year through Monday’s close, reflecting growing investor confidence in the AI infrastructure sector.

09.09.2025
Revolutionary silicon photonic chip Boosts AI Efficiency (with efficiency 10 or even 100 times that of current chips performing the same calculations)

Researchers at the University of Florida (UF) have unveiled a groundbreaking silicon photonic chip that leverages light instead of electricity to perform complex artificial intelligence (AI) tasks, achieving up to 100 times greater power efficiency than traditional electronic processors. Published on September 8, 2025, in Advanced Photonics, this innovation marks a significant step toward sustainable AI computing, addressing the escalating energy demands of modern machine learning models.

The chip, developed by a team led by Volker J. Sorger, the Rhines Endowed Professor in Semiconductor Photonics at UF, focuses on convolution operations—a core component of AI tasks like image recognition, video processing, and language analysis. These operations are notoriously power-intensive, straining global power grids as AI applications proliferate. By integrating optical components, such as lasers and microscopic Fresnel lenses, onto a silicon chip, the team has created a system that performs convolutions with near-zero energy consumption and significantly faster processing speeds. Tests demonstrated the chip’s ability to classify handwritten digits with 98% accuracy, matching the performance of conventional electronic chips while drastically reducing power usage.

A key advantage of this photonic chip is its use of wavelength multiplexing, allowing multiple data streams to be processed simultaneously using different colors of light. “We can have multiple wavelengths, or colors, of light passing through the lens at the same time,” said Hangbo Yang, a research associate professor and co-author of the study. This capability enhances data throughput and efficiency, making the chip ideal for high-demand applications like autonomous vehicles, healthcare diagnostics, and telecommunications. The chip’s design, built using standard semiconductor manufacturing techniques, ensures scalability and potential integration with existing AI systems, such as those by NVIDIA, which already incorporate optical elements.

The research, conducted in collaboration with the Florida Semiconductor Institute, UCLA, and George Washington University, addresses a critical challenge in AI: the unsustainable energy consumption of traditional electronic chips. As AI models grow more complex, they push conventional hardware to its limits, with data centers projected to consume vast amounts of electricity by 2026. The UF team’s photonic chip offers a solution by performing computations at the speed of light, reducing both power demands and heat generation. “Performing a key machine learning computation at near zero energy is a leap forward for future AI systems,” Sorger noted, emphasizing its potential to scale AI capabilities sustainably.

While the chip represents a major advancement, challenges remain, including integrating photonic systems with existing electronic infrastructure and scaling the technology for broader applications. However, its compatibility with commercial foundry processes suggests a viable path to market. As Sorger predicts, “In the near future, chip-based optics will become a key part of every AI chip we use daily.” This breakthrough not only paves the way for more efficient AI but also signals a paradigm shift toward optical computing, promising a greener, faster future for technology.

09.09.2025
Apple sued by authors over use of books in AI training

On September 5, 2025, authors Grady Hendrix and Jennifer Roberson filed a proposed class-action lawsuit against Apple in the U.S. District Court for the Northern District of California, accusing the company of illegally using their copyrighted books to train its “OpenELM” large language models. The lawsuit alleges that Apple relied on a dataset of pirated books, specifically from “shadow libraries” like Books3, without consent, credit, or compensation, violating intellectual property rights. The plaintiffs claim their works were included in this dataset, and Apple’s actions undermine authors’ rights by creating AI outputs that compete with original works. The lawsuit seeks damages, restitution, and potentially the destruction of models trained on pirated content, with one source estimating damages at $2.5 billion.

This case is part of a broader wave of legal actions against tech companies, including Microsoft, Meta, and OpenAI, for similar misuse of copyrighted materials in AI training. Notably, Anthropic recently settled a related lawsuit for $1.5 billion, described as the largest copyright recovery to date, setting a precedent that could influence Apple’s case. Apple has not publicly responded, but it may argue fair use or technical distinctions about its OpenELM models. The outcome could shape AI development and copyright law, especially as Apple pushes its AI initiatives, including an overhaul of Siri.

08.09.2025
Why AI chatbots hallucinate, according to OpenAI researchers

Language models, despite their remarkable advancements, often produce hallucinations—plausible but false statements delivered with unwarranted confidence. A recent OpenAI research paper, published on September 5, 2025, delves into why these errors persist and how current evaluation practices inadvertently exacerbate the issue. This article explores the key findings, shedding light on the mechanisms behind hallucinations and proposing solutions to mitigate them.

Hallucinations occur when models generate incorrect answers to seemingly straightforward questions. For instance, when queried about a person’s birthday or the title of a PhD dissertation, a model might confidently provide multiple incorrect responses. This stems from the way models are trained and evaluated. Unlike spelling or syntax, which follow consistent patterns, factual details like birthdays are often arbitrary and lack predictable structures in training data. This randomness makes it nearly impossible for models to avoid errors entirely, especially for low-frequency facts.

The root of the problem lies in pretraining, where models learn by predicting the next word in vast text corpora. Without explicit “true/false” labels, models cannot easily distinguish valid from invalid statements. They rely on patterns in fluent language, which works well for consistent elements like grammar but falters for specific, unpredictable facts. As a result, hallucinations emerge when models attempt to fill in gaps with plausible guesses rather than admitting uncertainty.

Current evaluation methods further aggravate this issue. Most benchmarks prioritize accuracy—rewarding correct answers while ignoring whether a model guesses or abstains when uncertain. This setup mirrors a multiple-choice test where guessing might yield points, but admitting “I don’t know” scores zero. For example, the SimpleQA evaluation shows that models like OpenAI’s o4-mini achieve slightly higher accuracy (24%) than gpt-5-thinking-mini (22%) but have a significantly higher error rate (75% vs. 26%) due to excessive guessing. This incentivizes models to prioritize lucky guesses over cautious abstention, undermining humility—a core value at OpenAI.

To address this, the paper proposes rethinking evaluation metrics. Instead of focusing solely on accuracy, scoreboards should penalize confident errors more heavily than expressions of uncertainty. Partial credit for abstaining or acknowledging uncertainty could discourage blind guessing, aligning model behavior with real-world reliability. This approach draws inspiration from standardized tests that use negative marking for wrong answers, a practice that could be adapted to AI evaluations.

The paper also dispels common misconceptions. Hallucinations are not inevitable; models can abstain when uncertain. Nor are they exclusive to smaller models—larger models, despite knowing more, may struggle to gauge their confidence accurately. Most critically, achieving 100% accuracy is unrealistic, as some questions are inherently unanswerable due to missing information or ambiguity. Simply adding hallucination-specific evaluations is insufficient; primary metrics across all benchmarks must reward calibrated responses.

OpenAI’s latest models, including GPT-5, show reduced hallucination rates, particularly when reasoning, but the challenge persists. By refining evaluation practices and prioritizing uncertainty-aware metrics, the AI community can foster models that balance accuracy with humility, ultimately making them more reliable for real-world applications.

08.09.2025
OpenAI is developing its own AI inference chip
Reports confirm OpenAI is advancing its first custom AI chip, focused on inference (running trained models for predictions and decisions), in collaboration with Broadcom for design and intellectual property (IP) and TSMC for manufacturing on a 3nm process node. Mass production is targeted for 2026, aligning with the details in your query. The project is led by former Google engineer Richard Ho, who heads a team of about 40 specialists, many with experience from Google’s Tensor Processing Units (TPUs). This initiative aims to reduce OpenAI’s heavy reliance on Nvidia GPUs, which dominate the AI hardware market but face shortages and high costs.

Key Developments from Recent Reports (September 2025)
- Partnership Confirmation and $10B Deal: On September 5, 2025, the Financial Times and Reuters reported that OpenAI is finalizing the chip design in the coming months, with Broadcom providing engineering support and TSMC handling fabrication. Broadcom’s CEO Hock Tan disclosed a $10 billion order from a new AI client (widely identified as OpenAI) during an earnings call, boosting Broadcom’s AI revenue projections for fiscal 2026. This deal focuses on custom “XPUs” (AI processors) for internal use, not commercial sale, emphasizing inference workloads with potential for scaled training. OpenAI has scaled back earlier ambitions to build its own foundries due to costs exceeding hundreds of millions per iteration, opting instead for this partnership model.
- Team and Technical Specs: Led by Richard Ho (ex-Google TPU head), the team includes engineers like Thomas Norrie. The chip features a systolic array architecture (similar to Google’s TPUs for efficient matrix computations), high-bandwidth memory (HBM, possibly HBM3E or HBM4), and integrated networking. It’s optimized for OpenAI’s models like GPT-4 and beyond, with initial small-scale deployment for inference to test viability. Analysts note risks, including potential delays or underperformance on the first tape-out (design finalization for production), as seen in other custom chip efforts by Microsoft and Meta.
- Market Impact: Broadcom shares surged over 10% on September 5, reaching a $1.7 trillion market cap, while Nvidia and AMD dipped ~2-3% amid concerns over custom silicon eroding Nvidia’s 80%+ market share. HSBC analysts predict the custom AI chip market could surpass Nvidia’s GPU business by 2026. OpenAI’s move ties into broader AI infrastructure pushes, including the $500B Stargate project (with Oracle) and collaborations like Microsoft’s Maia chips.
Broader Context and Challenges

OpenAI’s compute costs are massive—projected $5B loss in 2024 on $3.7B revenue—driving this diversification. The company is also integrating AMD’s MI300X chips via Azure for training, complementing Nvidia. Geopolitical risks (e.g., TSMC’s Taiwan base) and high development costs (~$500M+ per chip version, plus software) loom, but success could enhance bargaining power and efficiency. No official OpenAI statement yet, but industry sources indicate tape-out soon, with prototypes possible by late 2025.

This positions OpenAI alongside Google, Amazon, and Meta in the custom silicon race, potentially reshaping AI hardware dynamics. Updates could emerge from upcoming tech conferences or earnings.
06.09.2025
Qwen3-Max-Preview is the preview release of Alibaba’s Qwen3-Max is now live on OpenRouter
Qwen3-Max-Preview is the preview release of Alibaba’s Qwen3-Max, the flagship model in the Qwen3 series developed by Alibaba Cloud’s Qwen team. It’s a massive Mixture-of-Experts (MoE) large language model with over 1 trillion parameters, designed for advanced reasoning, instruction following, and multimodal tasks. Key features include:
- Improvements over prior versions: Major gains in math, coding, logic, science accuracy; better multilingual support (100+ languages, including strong Chinese/English handling); reduced hallucinations; higher-quality open-ended responses for Q&A, writing, and conversation.
- Optimizations: Excels in retrieval-augmented generation (RAG), tool calling, and long-context understanding (up to 256K tokens, extendable to 1M). It lacks a dedicated “thinking” mode but focuses on efficient, reliable outputs.
- Architecture: Built on Qwen3’s MoE framework, pretrained on trillions of tokens with Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF). It’s positioned as a high-capacity model for complex, multi-step tasks, competing with top closed-source LLMs like GPT-4 or Claude 3.5.
This preview allows early testing before full release, emphasizing production usability over experimental features.

News: Now Live on OpenRouter

As of September 5, 2025, Qwen3-Max-Preview became available on OpenRouter, a unified API platform for 400+ AI models. Alibaba’s official Qwen account confirmed the launch, highlighting its strengths in reasoning and tool use. OpenRouter integration enables easy access via OpenAI-compatible APIs, with token-based pricing (e.g., tiered by input/output length; specifics vary by provider but start low for previews). Users can route requests through OpenRouter for vendor-agnostic setups, avoiding lock-in.
- Access Details: Available at openrouter.ai/models (search “Qwen3-Max”) or directly via API endpoint. Free tiers may have limits; paid starts at ~$1.60/M input tokens. It’s also accessible via Qwen Chat (interactive UI) and Alibaba Cloud (enterprise IAM).
- Community Buzz: Early X posts praise its potential for coding/programming (e.g., “saves my programmer life?”), with calls for benchmarks. No major issues reported yet, but expect high compute costs due to scale.
This rollout positions Qwen3-Max-Preview as a key player in the open-weight AI race, with full Qwen3 updates (e.g., thinking modes) expected soon.
06.09.2025
Rumors About Gemini 3.0 on OpenRouter (Sonoma Alpha and Sonoma Sky Alpha)
As of September 6, 2025, there’s active speculation in the AI community that Google’s upcoming Gemini 3.0 model (or an early version of it) has been quietly released on OpenRouter under disguised names. OpenRouter, a platform aggregating access to hundreds of AI models via a unified API, announced two new “stealth models” yesterday: Sonoma Alpha and Sonoma Sky Alpha (also referred to as Sonoma Dusk Alpha in some posts). These are free to use, support a massive 2 million token context window, and are described as “maximally intelligent” with prompts logged by the creator for training—features that align closely with expected Gemini 3 specs.

Key Details from the Announcement and Speculation
- Announcement: OpenRouter posted about these models on September 5, 2025, calling them “stealth” (implying anonymity to avoid direct attribution). They emphasize high intelligence, 2M context (double the 1M seen in Gemini 2.5 Pro), and free access, but note that the provider handles logging for improvement.
- Why Gemini 3.0?
  - Leaks from July-August 2025 referenced “gemini-beta-3.0-pro” and “gemini-beta-3.0-flash” in Google’s internal code (e.g., Gemini CLI repo), hinting at variants with enhanced reasoning (“Deep Think”) and multi-million token contexts—matching the 2M here.
  - Community tests and posts suggest strong performance in reasoning, speed, and multimodal tasks, outperforming current Gemini 2.5 models but falling short of full Gemini-like polish in some outputs (e.g., one user called it “disappointing” compared to known Gemini quality).
  - The “Sonoma” naming (evoking California’s wine country, near Google’s HQ) fuels the theory, as does the free tier—Google has previously offered experimental Gemini models for free on OpenRouter to gather data (e.g., Gemini 2.5 Pro Experimental in March 2025).
- Alternative Theories: Not everyone agrees—some speculate it’s an xAI Grok variant (due to “maximally intelligent” phrasing echoing xAI’s ethos) or a new Chinese model. However, the 2M context and free logging point more toward Google testing pre-release.
Broader Gemini 3.0 Context

Google hasn’t officially announced Gemini 3.0, but rumors from mid-2025 predict a late 2025 release (preview in December, full in early 2026), building on Gemini 2.5’s “thinking” mode with:
- Trillion-parameter scale for superior reasoning in code, math, and multimodality (text, images, video, 3D).
- Integrated self-correction to reduce hallucinations.
- On-device variants like Gemini Nano 3 for Pixels.
These stealth models could be betas, allowing Google to benchmark against rivals like GPT-5 or Grok 4 without fanfare. OpenRouter’s history (e.g., hosting “Quasar Alpha” in April 2025, speculated as GPT-5/Gemini 3) supports this pattern of anonymous drops.
06.09.2025
OpenAI for Science: Pioneering AI-Driven Scientific Discovery

OpenAI announced the launch of OpenAI for Science, an ambitious initiative to accelerate scientific discovery through artificial intelligence, as revealed in a post by Chief Product Officer Kevin Weil on X. The program aims to build an AI-powered platform described as “the next great scientific instrument,” leveraging the advanced reasoning capabilities of GPT-5 to assist researchers in formulating hypotheses, designing experiments, and analyzing data. While a specific timeline for the platform’s release remains undisclosed, OpenAI plans to share more details in the coming months, signaling a transformative step toward automating scientific processes.

The initiative builds on OpenAI’s prior successes in applying AI to science, notably its collaboration with Retro Biosciences, where a custom model, GPT-4b micro, achieved a 50x improvement in stem cell reprogramming markers, outperforming human-designed proteins. This work, published on August 22, 2025, demonstrated AI’s potential to accelerate breakthroughs in biology, with applications in longevity research. OpenAI for Science extends this vision, targeting fields like physics, chemistry, and mathematics, where GPT-5 has shown promise, such as suggesting proof ideas in theoretical physics. The platform will integrate tools like the “deep research” model, launched in February 2025, which synthesizes cited, multi-page reports from web data, aiding literature reviews and niche information retrieval.

OpenAI is recruiting “AI-pilled” academics to join the effort, emphasizing interdisciplinary collaboration to tackle high-impact challenges. The initiative complements existing programs like NextGenAI, launched in March 2025, which provided $50 million to institutions like MIT and Harvard for AI-driven research in healthcare, education, and more. Unlike NextGenAI’s focus on institutional partnerships, OpenAI for Science prioritizes a unified platform to streamline scientific workflows, potentially reducing the 45% of researcher time spent on grant writing.

Sentiment on X is optimistic, with users like @BorisMPower praising the initiative’s potential to revolutionize science, though some express concerns about GPT-5’s mixed reception, citing its inconsistent performance compared to GPT-4o. Critics also highlight OpenAI’s shift from safety-focused commitments, referencing a July 2025 lawsuit in Hawaii alleging inadequate safeguards in ChatGPT’s deployment. Despite these concerns, the initiative’s focus on verifiable outputs with clear citations aims to address transparency issues.

OpenAI for Science positions the company as a leader in AI-driven discovery, competing with Google’s DeepMind, whose AlphaFold won a Nobel Prize. By harnessing GPT-5’s reasoning and integrating it into a dedicated platform, OpenAI aims to empower researchers globally, though its success hinges on addressing technical and ethical challenges in the rapidly evolving AI landscape.

05.09.2025
Tencent’s HunyuanWorld-Voyager: Open-Source AI Turns Images into 3D Worlds

Tencent’s Hunyuan team released HunyuanWorld-Voyager, an open-source AI model that transforms single images into explorable 3D worlds, marking a breakthrough in generative AI. Announced on X by @TencentHunyuan, the model generates 3D-consistent RGB-D video sequences and point clouds, enabling users to navigate virtual environments with user-defined camera paths. Available on GitHub and Hugging Face, HunyuanWorld-Voyager has topped the WorldScore benchmark with a score of 77.62, surpassing competitors like WonderWorld (72.69) and CogVideoX-I2V (62.15), excelling in style consistency (84.89) and object control (66.92).

The model’s core innovation lies in its ability to create geometry-consistent 3D scenes from a single image, bypassing traditional modeling pipelines. It uses a video diffusion framework with synchronized RGB and depth outputs, supported by a world-caching system and autoregressive sampling to maintain spatial coherence over long camera trajectories. This enables applications in game development, virtual reality (VR), and augmented reality (AR), allowing developers to prototype immersive worlds or generate cinematic fly-throughs rapidly. For instance, a user can upload an image of a forest and explore it as a 3D environment with accurate depth and perspective, exportable as meshes for Unity or Unreal Engine.

HunyuanWorld-Voyager builds on Tencent’s HunyuanWorld 1.0, released in July 2025, which focused on static 3D mesh generation from text or images. Voyager extends this by offering dynamic, long-range exploration with real-time depth estimation, ideal for VR experiences and robotic navigation. However, its high computational demands—requiring at least 60GB of GPU memory for 540p resolution—limit accessibility to well-equipped labs or enterprises. Licensing restrictions also prohibit use in the EU, UK, and South Korea, and commercial applications with over 100 million monthly users require separate approval.

X users, like @Hathibel, have shared demos, such as a 3D Alaskan town generated from a text prompt, praising its visual quality despite high VRAM usage (33GB). Critics note that the model produces 2D video frames mimicking 3D movement rather than true 3D models, with each generation limited to 49 frames (about two seconds), though clips can be chained for longer sequences. Compared to Google’s Genie 3 or Dynamics Lab’s Mirage 2, Voyager’s open-source nature and direct 3D reconstruction set it apart, though it lags slightly in camera control (85.95 vs. WonderWorld’s 92.98).

Tencent’s open-source strategy, including code, weights, and documentation, aims to democratize 3D content creation, fostering collaboration in gaming, VR, and simulation. As the first open-source model of its kind, HunyuanWorld-Voyager challenges proprietary systems, but its hardware demands and regional restrictions may hinder widespread adoption.

05.09.2025
DeepSeek’s Advanced AI Agent Set to Challenge OpenAI by Q4 2025

Bloomberg reported that Chinese AI startup DeepSeek is developing an advanced artificial intelligence agent model aimed at rivaling U.S. giants like OpenAI, with a planned release in the fourth quarter of 2025. The Hangzhou-based company, founded by Liang Wenfeng in July 2023, is designing this model to perform complex, multi-step tasks with minimal human input, learning and improving from past actions. This move positions DeepSeek at the forefront of the global race to create autonomous AI agents, considered the next evolution of AI technology.

Unlike traditional chatbots, DeepSeek’s new model will execute sophisticated tasks such as researching travel plans or debugging code, aligning with industry trends seen in recent agent-focused releases from OpenAI, Anthropic, and Microsoft. The model builds on DeepSeek’s January 2025 release of DeepSeek-R1, a reasoning-focused model that matched OpenAI’s o1 in benchmarks like MATH-500, costing just $6 million to train compared to over $100 million for OpenAI’s GPT-4. DeepSeek’s efficiency stems from innovative techniques like mixture-of-experts (MoE) layers and optimized chip use, despite U.S. export controls limiting access to advanced Nvidia chips. The company leveraged a stockpile of 10,000 A100 chips and lower-power H800 chips to achieve this.

The upcoming agent model, not yet named, is expected to enhance DeepSeek’s reputation for cost-effective, high-performing AI. X posts reflect excitement, with users like @zijing_wu noting China’s push to triple AI chip production to support DeepSeek’s ambitions, including native support for the UE8M0 FP8 format for faster processing. However, some skepticism persists, with posts citing DeepSeek’s relatively slow pace of updates compared to rivals like Alibaba’s Qwen. The company has also implemented strict policies, mandating visible and hidden markers like “AI-generated” labels to prevent misuse, backed by Chinese regulations.

DeepSeek’s focus on AI agents aligns with a broader industry shift toward automation, though current agents often require significant oversight. The Q4 2025 release could intensify competition, especially as OpenAI faces scrutiny over high development costs. If successful, DeepSeek’s model may further disrupt the AI landscape, building on its R1 success that triggered a $1 trillion tech stock sell-off in January 2025, including a record $593 billion single-day loss for Nvidia. As DeepSeek advances, its open-source approach and efficiency could redefine global AI innovation.

05.09.2025