victorknox

Follow

🦧

Vamshi Krishna Bonagiri victorknox

🦧

Follow

AI Researcher | Phd@ MBZUAI | CHAI, UC Berkeley | IIIT Hyderabad | Precog

128 followers · 176 following

Abu Dhabi, UAE
17:21 (UTC -12:00)
https://bonagiri.io

Achievements

Achievements

Highlights

Pro

Organizations

Pinned Loading

SaGE SaGE Public

Forked from vnnm404/SaGE

The official repo implementing, SaGE: Evaluating Moral Consistency in Large Language Models.

Python
QuittingAgents QuittingAgents Public

Code for the paper: Check Yourself Before You Wreck Yourself: Selectively Quitting Improves LLM Agent Safety

Python
sycophancy-detection sycophancy-detection Public

This project implements activation-based detection approaches from representation engineering to identify when LLMs give sycophantic (user-pleasing rather than truthful) responses.

Python 1
picoVLM picoVLM Public

Forked from huggingface/nanoVLM

Editing nanovlm for efficiency

Python
idecir-Towards-Effective-Paraphrasing-for-Information-Disguise idecir-Towards-Effective-Paraphrasing-for-Information-Disguise Public

Forked from idecir/idecir-Towards-Effective-Paraphrasing-for-Information-Disguise

Python