Look, the naming ship has sailed and sunk somewhere in the middle of the ocean. I think it’s time to accept that “AI” just means “generative model” and what we would have called “AI” is now more narrowly “AGI”.
People call videogame enemies “AI”, too, and it’s not the end of the world, it’s just imprecise.
Yeah, on that I’m gonna say it’s unnecessary. I don’t know what “integration with the desktop” gets you that you can’t get from having a web app open or a separate window open. If you need some multimodal goodness you can just take a screenshot and paste it in.
I’d be more concerned about model performance and having a well integrated multimodal assistant that can do image generation, image analysis and text all at once. We have individual models but nothing like that that is open and free, that I know of.