-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Block internet search engines from indexing the mirror #48
Comments
The benefit is entirely for clearnet users. Tor users, for example, will (almost) always be able to access Wikipedia over tor so they'll see little benefit. We should probably add a |
If anonymity is the concern, then accessing IPFS through the
Doing so would have the intended effect of removing the mirror from Google search results, and it is actually the preferred way to implement this. |
Ah, I think the confusion may be around the definition of "clearnet". IPFS is a clearnet. That is, it's not a darknet (it provides no anonymity at the moment). Darknets get no benefit because the exit nodes tend to be in countries with strong free speech laws.
Unlikely. We don't have any IPFS search mechanisms and rely entirely on web search engines. That's probably one of the reasons we don't use |
+1 for setting rel='canonical' links. I'm starting to see the mirror pop up frequently on the first page of Google results just from normal everyday use. Canonical links should avoid this duplication and make the mirror a good web citizen. |
The lack of canonical tag comes from the htmls generated by kiwix's mwoffiler. I opened an issue openzim/mwoffliner#564 |
I understand, but you can also add a canonical link in the webserver response headers. |
Context: ipfs/distributed-wikipedia-mirror#48 License: MIT Signed-off-by: Marcin Rataj <lidel@lidel.org>
Context: ipfs/distributed-wikipedia-mirror#48 License: MIT Signed-off-by: Marcin Rataj <lidel@lidel.org>
I fixed this upstream (openzim/mwoffliner#963) 👌 Remaining steps before this issue can be closed:
OR:
I will be checking on mwoffliner/kiwix situation, but if someone has spare bandwidth and can to speed things up, please contribute upstream & post updates here. |
If possible, are you able to make your mirror non-indexed by internet search engines? There is very minimal benefit for clearnet users to run across three (WMF, WikiVisually and ipfs) different copies of the Wikipedia article every time they search for something.
The text was updated successfully, but these errors were encountered: