#open-weight-models tag

ByteDance and DeepSeek Are Placing Very Different AI Bets

DeepSeek focuses on open-weight model capability while ByteDance prioritizes broad integration of AI into devices and apps.

Artificial intelligence

fromInfoWorld

1 day ago

Mistral targets lightweight processors with its biggest open model yet

Enterprises favor open-weight, on-prem models like Mistral for private, customizable, cost-effective internal applications while using proprietary APIs for external-facing, risk-managed services.

Artificial intelligence

fromTechCrunch

2 days ago

Mistral closes in on Big AI rivals with new open-weight frontier and small models | TechCrunch

Mistral released Mistral 3: a 10-model open-weight family with a large multimodal multilingual frontier model and nine smaller offline-capable, customizable models.

Artificial intelligence

fromNature

4 days ago

China wants to lead the world on AI regulation - will the plan work?

Global AI regulation remains unresolved; China proposes WAICO and emphasizes open-weight models and economic-growth applications while the US favors deregulation.

#openai

fromWIRED

3 weeks ago

US politics

OpenAI's Open-Weight Models Are Coming to the US Military

fromBusiness Insider

3 months ago

Artificial intelligence

Sam Altman says there was a big reason OpenAI released its open-weight models

fromWIRED

3 weeks ago

US politics

OpenAI's Open-Weight Models Are Coming to the US Military

fromBusiness Insider

3 months ago

Artificial intelligence

Sam Altman says there was a big reason OpenAI released its open-weight models

more#openai

fromIT Pro

4 weeks ago

Some of the most popular open weight AI models show 'profound susceptibility' to jailbreak techniques

A host of leading open weight AI models contain serious security vulnerabilities, according to researchers at Cisco. In a new, researchers found these models, which are publicly available and can be downloaded and modified by users based on individual needs, displayed "profound susceptibility to adversarial manipulation" techniques. Cisco evaluated models by a range of firms including: Alibaba (Qwen3-32B) DeepSeek (v3.1) Google (Gemma 3-1B-IT) Meta (Llama 3.3-70B-Instruct) Microsoft (Phi-4) OpenAI (GPT-OSS-20b) Mistral (Large-2).

Artificial intelligence

fromNature

1 month ago

Customizable AI systems that anyone can adapt bring big opportunities - and even bigger risks

Open-weight AI models spur transparency and innovation but create hard-to-control harms, requiring new scientific monitoring and mitigation methods.

fromApp Developer Magazine

11 months ago

OpenAI open weight models released for optimized laptop performance

OpenAI has released two open-weight language models designed to operate efficiently on laptops and personal computers. These models are intended to provide advanced reasoning capabilities while allowing developers greater flexibility through local deployment and fine-tuning. Unlike proprietary models, open-weight models provide public access to trained parameters, enabling developers to adapt the models for specific tasks without access to the original training datasets. This approach improves control over AI applications and supports secure, local usage in environments with sensitive data.

Artificial intelligence

fromIT Pro

2 months ago

DeepSeek's R1 model training costs pour cold water on big tech's massive AI spending

DeepSeek trained its R1 reasoning model for about $294,000 using 512 Nvidia H800 chips, plus ~$6M for its base LLM.

fromwww.nature.com

2 months ago

Secrets of DeepSeek AI Model Revealed in Landmark Paper

The success of DeepSeek's powerful artificial intelligence (AI) model R1 that made the US stock market plummet when it was released in January did not hinge on being trained on the output of its rivals, researchers at the Chinese firm have said. The statement came in documents released alongside a peer-reviewed version of the R1 model, published today in Nature.

Artificial intelligence

#open-source-ai

fromNextgov.com

3 months ago