Here is my attempt to archive r/homelab before it went dark. Google says there about 92,400 results for site:reddit.com/r/homelab, I have 2098, that’s only about 2% of it. Maybe there is something of use to you in that 2%.
Please don’t webscrape, if you want all the data you can get the raw BDFR archive at https://archive.douwes.co.uk/reddit/homelab.tar or the live web version at https://archive.douwes.co.uk/reddit/homelab-web.tar
I don’t know if self hosting is good for this purpose. What if something happens to the owner of the server or to the server itself? I think it is better to use something like lemmy.world, lm or other instances of Lemmy