Safeguarding AI: Effectiveness of Guardrails in Controlling Malicious Output from Locally Hosted LLMs

This paper explores the effectiveness of open-source guardrails that can be added to LLM-based conversational applications to mitigate the threat of potential misuse.
By
Jared McWherter
August 21, 2024

All papers are copyrighted. No re-posting of papers is permitted

470x382_Research_Paper_gray.jpg