2 matches found
PoTS: Proof-Of-Training-Steps for Backdoor Detection in Large Language Models
As Large Language Models LLMs gain traction across critical domains, ensuring secure and trustworthy training processes has become a major concern. Backdoor attacks, where malicious actors inject hidden triggers into training data, are particularly insidious and difficult to detect. Existing...
Engineering Trustworthy Machine-Learning Operations with Zero-Knowledge Proofs
As Artificial Intelligence AI systems, particularly those based on machine learning ML, become integral to high-stakes applications, their probabilistic and opaque nature poses significant challenges to traditional verification and validation methods. These challenges are exacerbated in regulated...