
Teaching Trust: How Small AI Models Can Make Larger Systems More Reliable
As Gen AI technology continues to rapidly evolve and LLMs are integrated into more and more applications, questions of trustworthiness and ethical alignment become increasingly crucial. In the recent study “Generalizing Trust: Weak-to-Strong Trustworthiness in Language Models,” authors Martin Pawelczyk, postdoctoral researcher at Harvard working on trustworthy AI; Lillian Sun, undergraduate student at Harvard studying […]