There are no reviews yet. Be the first to send feedback to the community and the maintainers!
LM-exp
LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spacesCAA
Steering Llama 2 with Contrastive Activation AdditionInfluenceFunctions
Implementation of Influence Function approximations for differently sized ML models, using PyTorchActivationDirectionAnalysis
EmbedMap
Generate a 3D map of links based on their embeddings using OpenAI's embedding APIdevinterp
Quantifying degeneracy in toy modelsmlexperiments
Exploratory work analyzing loss landscapes of neural nets / steering between generalizations / phase transitionsBehaviorEvals
Scripts for evaluating LLMsESP3D-WEBUI
Web App for Paige Braille display and PCBFeedbackr
Easily collect yes/no feedback on language model outputs from humansLove Open Source and this site? Check out how you can help us