

A very similar situation to that analysed in this paper that was recently published. The quality of what is generated degrades significantly.
Although they mostly investigate replacing the data with ai generated data in each step, so I doubt the effect will be as pronounced in practice. Human writing will still be included and even curation of ai generated text by people can skew the distribution of the training data (as the process by these editors would inevitably do, as reasonable text could get through the cracks.)
It is already up and running, you can see posts from various government agencies at https://social.overheid.nl/public/local
Thunderbird has RSS integrated, which could be quite neat once that synchronizes.