Ah, I don't know much about multi modal models but I wonder what they'd think of...

noddybear · 2025-03-11T16:31:21 1741710681

The thing is that when you create a dense ASCII representation, any gain you might make from the spatial relationships is lost by: a) the tokeniser not working on characters alone (remember strawberrry), and b) the increased number of 'dead' tokens encoding not very much.

Our sparse encoding seems to confuse the models less - even though it certainly isn't perfect.

kridsdale1 · 2025-03-11T15:56:12 1741708572

I mean at some point you compress the board state down to Dwarf Fortress with an extended ASCII representation for each grid-state (maybe 2 bytes each?)

vessenes · 2025-03-11T16:38:29 1741711109

Lots of questions here - you need item, orientation, info about pipes (2 directions) , belts (3 or 4 colors x2 directions). Do you wish Circuits?