

No there’s some ideas out there. Concepts like heirarchical reinforcement learning are more likely to lead to AGI with creation of foundational policies, problem is as it stands, it’s a really difficult technique to use so it isn’t used often. And LLMs have sucked all the research dollars out of any other ideas.
I’ve had the same issue with episodes generally. I have a python script that runs on a regular basis that just updates the episode number based on the premier date.