fromHackernoon1 year agoArtificial intelligenceThe Link Between Concept Frequency and AI Performance, Seen Through Images and Words | HackerNoon
fromMedium1 month agoArtificial intelligenceTwo Indispensable Tools for Measuring the Quality of AI Systems
fromHackernoon1 year agoOnline learningEnhancing Rhetorical Role Labeling with Training-Time Neighborhood Learning | HackerNoon
fromHackernoon4 months agoPrivacy professionalsThe Impact of Parameters on LLM Performance | HackerNoon
fromHackernoon1 year agoArtificial intelligenceThe Link Between Concept Frequency and AI Performance, Seen Through Images and Words | HackerNoon
fromMedium1 month agoArtificial intelligenceTwo Indispensable Tools for Measuring the Quality of AI Systems
fromHackernoon1 year agoOnline learningEnhancing Rhetorical Role Labeling with Training-Time Neighborhood Learning | HackerNoon
fromHackernoon4 months agoPrivacy professionalsThe Impact of Parameters on LLM Performance | HackerNoon
Artificial intelligencefromHackernoon1 year agoHow Concept Frequency Affects AI Image Accuracy | HackerNoonConcept frequency affects the zero-shot performance of models, with high frequency leading to variable scaling trends.
Artificial intelligencefromHackernoon1 year agoHow Dataset Diversity Impacts AI Model Performance | HackerNoonPretraining data diversity significantly influences model performance, particularly in generalization and predictive capabilities.
Artificial intelligencefromHackernoon1 month agoQDyLoRA in Action: Method, Benchmarks, and Why It Outperforms QLoRA | HackerNoonQuantized DyLoRA achieves superior performance in model fine-tuning tasks compared to previous techniques.
fromHackernoon6 months agoContextualizing SUTRA: Advancements in Multilingual & Efficient LLMs | HackerNoonAdvancements in Large Language Models emphasize the importance of multilingual support to address global linguistic diversity.
fromHackernoon1 year agoArtificial intelligenceAI Still Can't Explain a Joke-or a Metaphor-Like a Human Can | HackerNoon
Artificial intelligencefromInfoWorld3 months agoVector Institute aims to clear up confusion about AI model performanceDeepSeek and OpenAI's o1 models excel in performance, yet AI models still face significant challenges across various tasks.
fromHackernoon1 year agoArtificial intelligenceAI Still Can't Explain a Joke-or a Metaphor-Like a Human Can | HackerNoon
Artificial intelligencefromInfoWorld3 months agoVector Institute aims to clear up confusion about AI model performanceDeepSeek and OpenAI's o1 models excel in performance, yet AI models still face significant challenges across various tasks.
fromInfoQ1 month agoArtificial intelligenceMistral AI Releases Magistral, Its First Reasoning-Focused Language Model
fromBusiness Insider3 months agoArtificial intelligenceMeta's chief AI scientist says scaling AI won't make it smarter
fromComputerworld3 months agoArtificial intelligenceOpen AI's new models hallucinate more than the old onesAI models increasingly produce hallucinations, with newer versions being more prone to inaccuracies.
fromHackernoon3 months agoArtificial intelligenceReconstruction Evaluations Across Varying Amounts of Training Data: Mindeye2 | HackerNoonModel performance improves with increased training data, particularly in specialized contexts such as medical AI.
fromInfoQ1 month agoArtificial intelligenceMistral AI Releases Magistral, Its First Reasoning-Focused Language Model
Artificial intelligencefromBusiness Insider3 months agoMeta's chief AI scientist says scaling AI won't make it smarterYann LeCun argues against the belief that larger AI models always lead to smarter AI, highlighting the need for different approaches.
Artificial intelligencefromComputerworld3 months agoOpen AI's new models hallucinate more than the old onesAI models increasingly produce hallucinations, with newer versions being more prone to inaccuracies.
fromHackernoon3 months agoArtificial intelligenceReconstruction Evaluations Across Varying Amounts of Training Data: Mindeye2 | HackerNoon
Artificial intelligencefromTechCrunch1 month agoDeepSeek may have used Google's Gemini to train its latest model | TechCrunchDeepSeek's R1 model may have been trained on outputs from Google's Gemini, raising ethical concerns regarding data sourcing.
ScalafromHackernoon9 months agoWhat Makes Code LLMs Accurate? | HackerNoonPass@1 rates for Lua programming tasks show that quantization level impacts model performance, particularly affecting lower bit models.
fromHackernoon9 months agoScalaDo Smaller, Full-Precision Models Outperform Quantized Code Models? | HackerNoon
fromHackernoon9 months agoScalaDo Smaller, Full-Precision Models Outperform Quantized Code Models? | HackerNoon
fromHackernoon9 months agoThe V-Shaped Mystery of Inference Time in Low-Bit Code Models | HackerNoonHigher precision results in longer inference times, especially for incorrect solutions.Longer inference times do not guarantee improved performance across different models.
Online learningfromHackernoon1 year agoFine-tuned GPT-3.5 Performance for Explanatory Feedback | HackerNoonFine-tuning GPT-3.5 enhances its ability to identify praise in tutoring responses even with limited data.
Artificial intelligencefromHackernoon3 months agoHow LightCap Sees and Speaks: Mobile Magic in Just 188ms Per Image | HackerNoonLightCap model achieves real-time image processing on mobile devices, meeting efficiency demands for practical applications.
Software developmentfromInfoQ2 months agoWindsurf Launches SWE-1 Family of Models for Software EngineeringWindsurf's SWE-1 models support diverse software engineering tasks while improving performance and user experience.
fromHackernoon7 months agoWhere Glitch Tokens Hide: Common Patterns in LLM Tokenizer Vocabularies | HackerNoonThe study identifies a pattern of untrained tokens across various model families, revealing inefficiencies in tokenizer design.
Artificial intelligencefromTechCrunch3 months agoChatGPT: Everything you need to know about the AI chatbotOpenAI's ongoing development focuses on an open AI model, aiming to enhance accessibility and user engagement.
Artificial intelligencefromFuturism3 months agoOpenAI's Hot New AI Has an Embarrassing ProblemOpenAI's new models o3 and o4-mini show increased tendency to hallucinate, reversing a positive trend in model development.
Artificial intelligencefromTechzine Global3 months agoOpenAI launches o3 and o4-miniOpenAI's new models o3 and o4-mini enhance reasoning capabilities, offering greater efficiency and performance.
Artificial intelligencefromNature3 months agoAI race in 2025 is tighter than ever beforeThe AI competition is intensifying, with Chinese models challenging US leadership and performance gaps narrowing between top AI models.
Artificial intelligencefromHackernoon3 months agoWhy Smaller AI Models Are the Future of Domain-Specific NLP | HackerNoonSmaller, fine-tuned models outperform larger models for specific tasks in biomedical information retrieval.
fromTechCrunch4 months agoResearchers say they've discovered a new method of 'scaling up' AI, but there's reason to be skeptical | TechCrunchExperts are skeptical of the newly proposed AI scaling law called 'inference-time search' despite its potential for improving model performance.
ScalafromHackernoon4 months agoThe Future of AI Compression: Smarter Quantization Strategies | HackerNoonImpact-based parameter selection outperforms magnitude-based criteria in improving quantization for language models.
Artificial intelligencefromFuturism5 months agoOpenAI May Have Really Screwed Up With GPT-4.5OpenAI's GPT-4.5 is perceived as underwhelming despite claims of being the 'largest and most knowledgeable model'.High costs and slow performance contribute to skepticism regarding GPT-4.5's value.