mediocreatbest@lemmy.sdf.orgM to mediocreatbest@lemmy.sdf.orgEnglish · 1 year agoTaming AI Bots: Prevent LLMs from entering "bad" states using continuous guidance from the LLM ("is this good? bad?") to avoid bad states.arxiv.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10cross-posted to: math@lemmy.sdf.org
arrow-up11arrow-down1external-linkTaming AI Bots: Prevent LLMs from entering "bad" states using continuous guidance from the LLM ("is this good? bad?") to avoid bad states.arxiv.orgmediocreatbest@lemmy.sdf.orgM to mediocreatbest@lemmy.sdf.orgEnglish · 1 year agomessage-square0fedilinkcross-posted to: math@lemmy.sdf.org