Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> A modern sparse Transformer, for instance, is not "conscious," but it is an excellent engineering approximation of two core brain functions: the Global Workspace (via self-attention) and Dynamic Sparsity (via MoE).

Could you suggest some literature supporting this claim? Went through your blog post but couldn't find any.



Sorry, I didn't have time to find the relevant references at the time, so I'm attaching some now

https://www.frontiersin.org/journals/computational-neuroscie...

https://arxiv.org/abs/2305.15775




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: