Posts by Tags

interpretability

Understanding LLM Reasoning Through Meaning-Removed Steering Vectors

3 minute read

Published:

Large Language Models (LLMs) have shown remarkable capabilities in reasoning tasks, but understanding and controlling their internal reasoning processes remains a significant challenge. In my ongoing research at the MINE Lab (University of Notre Dame), I’m working on a novel approach to this problem: meaning-removed steering vectors.

llm reasoning

Understanding LLM Reasoning Through Meaning-Removed Steering Vectors

3 minute read

Published:

Large Language Models (LLMs) have shown remarkable capabilities in reasoning tasks, but understanding and controlling their internal reasoning processes remains a significant challenge. In my ongoing research at the MINE Lab (University of Notre Dame), I’m working on a novel approach to this problem: meaning-removed steering vectors.

machine learning

Understanding LLM Reasoning Through Meaning-Removed Steering Vectors

3 minute read

Published:

Large Language Models (LLMs) have shown remarkable capabilities in reasoning tasks, but understanding and controlling their internal reasoning processes remains a significant challenge. In my ongoing research at the MINE Lab (University of Notre Dame), I’m working on a novel approach to this problem: meaning-removed steering vectors.

research

Understanding LLM Reasoning Through Meaning-Removed Steering Vectors

3 minute read

Published:

Large Language Models (LLMs) have shown remarkable capabilities in reasoning tasks, but understanding and controlling their internal reasoning processes remains a significant challenge. In my ongoing research at the MINE Lab (University of Notre Dame), I’m working on a novel approach to this problem: meaning-removed steering vectors.