If you want to truly understand the way the world works, you may want to have a way to model the objects, events, interactions, and all their semantic connections, as they change over time.
But that's too hard.
Let's just have a universal parrot that has been trained to retain and repeat everything it has ever heard. Sooner or later, the parrot will sound like it understands things, without having to deal with that other messy stuff. All it needs is more examples to mimic.