We at first thought this list we found was a list of every web site on the Internet, but this site itself, Newsi8.com is not on the list. What gives? Lol. When we say you’ve found something super unknown by visiting this web site, we mean it!
Below is a link to the top 1 million web pages on the Internet (15MB, csv file) as of February 19, 2024:
Top_1000000-2024-02-19 (right click to download)
We believe the few major search engines are hiding what you may want to find, answers to many questions vital for your health and happiness, with algorhythms designed to show you only search results which make them money in some way.
Become A Search Engine
One idea with presenting this huge uncensored list is that if you could write your own web crawler you could get around all of the problems that a certain giant search engine is causing by its indirect censorship over recent years. Of course, you would, even then, not crawl cool lesser known web pages like Newsi8.com.
Obstacles to Becoming A Search Engine
The problem with this idea is that Big Tech really does not want to give up their stronghold. Therefore, they have gatekeepers like Cloudflare that watch all internet activity and block sites that seem to be horning in on thier monopoly. It can still be done, but, for example, you would not want to start scanning 100 URLs in alphabetic order or your IP address would quickly be put on a block list as containing malware. The ability to broadly define malware as anything that tries to upset your search monopoly is one of the great powers of Big Tech at this time in history. Yes, as you can probably guess, this feels pretty disgusting, illegal and unethical to me. But that’s the way it is.
Disclaimer
Beware, of course. The Top 1 Million web pages contain some of almost everything. There are definitely sites which may try to give you viruses, gambling, adult entertainment, strange portals, and who knows what else. If you are under 18, you have plenty of other things to be doing for your future. Do not waste time or risk getting in trouble exploring the semi-hidden Internet.
Precautions
Do not view any of these sites without at least the following precautions:
1. Make a backup of your entire operating system and test it so you are certain you can restore EVERYTHING from scratch if needed.
2. Use a VPN to protect your privacy.
3. Use plugin like NoScript to block scripts that may cause problems. Selectively enable, carefully, as needed because many sites are completely broken without JavaScript, etc.
4. Block pop-ups.
If a site tries to get you to install a plug-in for your browser, close the page or the entire browser and start again.
More About The List
The actual number of sites in the list is 1,000,263 and they are in alphabetic order from #1 which is 0-0-0.com (5 Dec 1998 – 31 May 2023, started as an adult site according to archive.org) to #1000263 which is zzzzzl.me (which redirects to http://www.zzzzzl.me for which the server is not found. The Wayback Machine at archive.org has not archived zzzzzl.me).
Broken Junk
Many of the pages on this big list are missing (status code 404 not found) or broken (error), or inaccessible (status code 404 forbidden) or are redirectors to other web sites. Therefore, if we can find a way, we may try to weed it down to just the ones that give an “OK” status code. The site can still be a blank page in that case, so it may not be much of an improvement, but it would be a start.