Yesterday I deleted whole #FediSearch index and started crawling the #fediverse from scratch.
So many new accounts should be discovereable now.
This entry was edited (2 years ago)
Yesterday I deleted whole #FediSearch index and started crawling the #fediverse from scratch.
So many new accounts should be discovereable now.
Štěpán Škorpil
in reply to Štěpán Škorpil • • •Štěpán Škorpil
in reply to Štěpán Škorpil • • •For now I added the badly configured instance to the blacklist.
Štěpán Škorpil
in reply to Štěpán Škorpil • • •Štěpán Škorpil
in reply to Štěpán Škorpil • • •NoLog.cz 🏴
in reply to Štěpán Škorpil • • •I'm not sure how it works on pre-4.0, but now the user directory is limited to 80 records per page and it can't be overridden with the 'limit' argument.
Štěpán Škorpil
in reply to NoLog.cz 🏴 • • •docs.joinmastodon.org/methods/…
directory API methods
docs.joinmastodon.orgNoLog.cz 🏴
in reply to Štěpán Škorpil • • •If I understand it correctly, it sets the limit to 500 users per page, but the server has it's internal limit on 80.
So if the instance has < 500 users, it only shows the first 80 and with >500 it probably undercounts by a lot if the same limit is everywhere.
github.com/Stopka/fedicrawl/bl…
fedicrawl/retrieveLocalPublicUsersPage.ts at 29acce39063d1dbfbe69bab22348855ff5ca21c2 · Stopka/fedicrawl
GitHubŠtěpán Škorpil
in reply to NoLog.cz 🏴 • • •Štěpán Škorpil
in reply to Štěpán Škorpil • • •NoLog.cz 🏴
in reply to Štěpán Škorpil • • •stop genocide in gaza
in reply to Štěpán Škorpil • • •Does this respect robots.txt and opt-outs, and limit itself to profiles?
Would be nice to have a statement about this on the site.
Laskuvirhe inventaariossa
in reply to stop genocide in gaza • • •@nikodemus I've opted out from search engine indexing and I could still find my profile.
Do. Not. Like. This.
Štěpán Škorpil
Unknown parent • • •Štěpán Škorpil
Unknown parent • • •Štěpán Škorpil
in reply to Štěpán Škorpil • • •Štěpán Škorpil
in reply to Štěpán Škorpil • • •