Artificial intelligencefromTechzine Global3 weeks agoSafety mechanisms of AI models more fragile than expectedA single unlabeled training prompt can undermine safety alignment in large language models.