research ideas

Improving “LLM as a judge” schemes by ensuring minimal semantic correlation between evaluation dimensions
”LLM’s don’t want to do work” - input a stipulation where an LLM is rating something that is supposed to be neutral, but stipulate that when it decides to give a bad rating it has to explain more
- first would have to research bias of LLMs in rating tasks in general.
Align a model to ask questions more often, “ask and you shall receive"
"Entering the flow state with LLMs”, use the sparse autoencoders idea where you can add a vector at the last layer and find out how much that improves LLM output
For robots arms that use cameras that are directly on the grippers, how can we introduce behaviour to “take a step back and look at the bigger picture”, before going back in?

Krish's Digital Jungle 🍃