Posts by Tags

Understanding LLM Reasoning Through Meaning-Removed Steering Vectors

3 minute read

Published: December 01, 2024

Large Language Models (LLMs) have shown remarkable capabilities in reasoning tasks, but understanding and controlling their internal reasoning processes remains a significant challenge. In my ongoing research at the MINE Lab (University of Notre Dame), I’m working on a novel approach to this problem: meaning-removed steering vectors.

Understanding LLM Reasoning Through Meaning-Removed Steering Vectors

3 minute read

Published: December 01, 2024

Large Language Models (LLMs) have shown remarkable capabilities in reasoning tasks, but understanding and controlling their internal reasoning processes remains a significant challenge. In my ongoing research at the MINE Lab (University of Notre Dame), I’m working on a novel approach to this problem: meaning-removed steering vectors.

Understanding LLM Reasoning Through Meaning-Removed Steering Vectors

3 minute read

Published: December 01, 2024

Large Language Models (LLMs) have shown remarkable capabilities in reasoning tasks, but understanding and controlling their internal reasoning processes remains a significant challenge. In my ongoing research at the MINE Lab (University of Notre Dame), I’m working on a novel approach to this problem: meaning-removed steering vectors.

Understanding LLM Reasoning Through Meaning-Removed Steering Vectors

3 minute read

Published: December 01, 2024

Large Language Models (LLMs) have shown remarkable capabilities in reasoning tasks, but understanding and controlling their internal reasoning processes remains a significant challenge. In my ongoing research at the MINE Lab (University of Notre Dame), I’m working on a novel approach to this problem: meaning-removed steering vectors.

Dan (Hojun) Yoo

Posts by Tags

interpretability

Understanding LLM Reasoning Through Meaning-Removed Steering Vectors

llm reasoning

Understanding LLM Reasoning Through Meaning-Removed Steering Vectors

machine learning

Understanding LLM Reasoning Through Meaning-Removed Steering Vectors

research

Understanding LLM Reasoning Through Meaning-Removed Steering Vectors