Allineamento dell'Intelligenza Artificiale
https://intelligence.org/get-involved/
https://www.alignmentforum.org/posts/PqMT9zGrNsGJNfiFR/alignment-research-field-guide
https://equilibriabook.com/toc/
https://arxiv.org/abs/1202.6153
https://www.lesswrong.com/s/Rm6oQRJJmhGCcLvxh
https://www.lesswrong.com/s/4dHMdK5TLN6xcqtyc
To understand (some) existing approaches and jargon, I’d recommend at least skimming these sequences/posts, and diving deeper into whichever most resemble the directions you want to pursue:
- Embedded Agency
- Value Learning
- 11 Proposals For Building Safe Advanced AI
- Risks From Learned Optimization