Skip to content
View likenneth's full-sized avatar

Highlights

  • Pro

Block or report likenneth

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. othello_world othello_world Public

    Emergent world representations: Exploring a sequence model trained on a synthetic task

    Jupyter Notebook 168 40

  2. honest_llama honest_llama Public

    Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

    Python 464 37

  3. persona_drift persona_drift Public

    Measuring and Controlling Persona Drift in Language Model Dialogs

    Python 12 3

  4. q_probe q_probe Public

    Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

    Jupyter Notebook 37 1

  5. dialogue_action_token dialogue_action_token Public

    Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner

    Python 15 1