I think this lack of 'G' (generality, or modality) is the problem. A human visualizes this kind of problem (a little video plays in my head of taking a car to a car wash). LLM's don't do this, they 'think' only in text, not visually.
A proper AGI would have have to have knowledge in video, image, audio and text domains to work properly.
Most of the USA's refineries specialize in low grade oil. The best grade oil is often shipped out of the USA for refining. Shipping costs are so low on a grand scale that it's more profitable to ship the USA's high quality oil overseas than building new refineries in the USA just for that:
https://www.marketplace.org/story/2024/05/13/the-u-s-exports...
Using 'copy' as a clipboard script tells me OP never lived through the DOS era I guess... Used to drive me mad switching between 'cp' in UNIX and 'copy' in DOS.
(Same with the whole slash vs backslash mess.)
I believe this is why all modern digital watches use a 32768.0Hz crystal resonator, it's a power-of-2 frequency above the 20Khz top end of the range of human audio perception, to avoid the whole 'tinnitus on your wrist' thing.
I believe youtube still uses 40 mel-scale vectors as feature data, whisper uses 80 (which provides finer spectral detail but is computationally more intensive to process naturally, but modern hardware allows for that)
that's not true, consumerism is only growing, people are not giving up anything in that regard.
The planet is getting trashed and 'the children' are doomed.
Individually We try and help, driving less, recycling and so on, but it kinda gets diluted by a billion Chinese moving into the middle-class and burning coal like there's no tomorrow.
A proper AGI would have have to have knowledge in video, image, audio and text domains to work properly.
reply