fromTheregister1 week agoOpenAI gpt-oss LLMs use MXFP4: smaller, faster, cheaperMXFP4 is a 4-bit floating point data type defined by the Open Compute Project, allowing massive compute savings compared to traditional data types used by LLMs.Artificial intelligence
Artificial intelligencefromHackernoon1 year agoMulti-Token Prediction: Mastering Algorithmic Reasoning with Enhanced Resource Use | HackerNoonMulti-token prediction in training language models allows for efficient resource allocation based on token prediction difficulty.
fromHackernoon1 year agoMutation Testing with GPT and CodeLlama | HackerNoonMutation testing enhances software reliability by identifying weaknesses in test suites through strategic code mutations.
fromHackernoon9 months agoBattle of the Algorithms: Why SGRLD Beats the Competition in GP Inference | HackerNoonThe SGRLD method significantly improves the estimation of spatial covariance parameters in large datasets compared to traditional Bayesian methods.
Artificial intelligencefromWIRED3 months agoGoogle DeepMind's AI Agent Dreams Up Algorithms Beyond Human ExpertiseAlphaEvolve demonstrates that AI models can generate novel and efficient algorithms that surpass human capabilities in specific tasks.