Google.cn is a Chinese language search service targeted towards users in the People’s Republic of China. It was launched on January 25 2006 and it filters search requests to content deemed to be “sensitive” by the government of China. (You can compare search results between the uncensored Chinese language Google.com and the censored Google.cn using the OpenNet Initiative’s Search Comparison tool.)
The filtering takes place in at least three ways:
- de-listed domains: specific websites are removed entirely from search results; it is as if the website never existed.
- de-listed urls: specific urls are removed from search results if they contain a de-listed domain.
- restricted keywords: specific keywords are restricted to searches of web pages hosted in China only.
The New York Times reports that the Chinese government did not give Google a list of sites to block. Rather, Google set-up a computer in China and tested to see what content was accessible and content found to be inaccessible was deemed to be sensitive and added to Google’s blocklist.
For example, the website for Human Rights Watch (hrw.org), which is blocked in China, has also been de-listed from Google.cn. A normal web request to hrw.org from within China triggers an error and the content of the site never loads in a users browser. A search in Google using the modifier “site:” for content on hrw.org (site:hrw.org) on Google.cn yields no results. In China, it is as if hrw.org does not exist.
Is there a way to circumvent Google’s censorship in China?
Google has an advertisement program, Google Adsense/Adwords, that allows one to purchase certain keywords that will display an ad on Google when users search for those words. I created an account with Google Ads and selected that my ad be shown in Chinese to users in China. I noticed that a warning appeared indicating that there may be restrictions on advertising in China.
Google describes some specific categories of content that require licensing (local pdf). This list does not include content that may be sensitive for political reasons.
Due to advertising regulations and laws of the People’s Republic of China, Google AdWords requires advertisers to submit business licenses and approval certificates for the following product categories: Agricultural Chemicals Books/Periodicals Cosmetics Food/Foodstuffs Health Supplements Medical Appliances Medical Services Patents Real Estate Veterinary Medicine
I created my ad (which does not appear to fall under these categories) for hrw.org, which is censored by google.cn, and it was held in a queue waiting to be viewed and labeled “Family Safe”. Only “Family Safe” ads are allowed to be shown by Google in China. Eventually my ad was approved as “Family Safe” and was labeled as currently being shown.
However, my ad was initially not shown on Google.cn.
Google indicates that there is an ad for the search terms I selected, but it is not shown. I emailed Google for an explanation of why my ad was not being shown and was informed that there may be a technical error.
My ad was being shown on the uncensored Chinese language Google, but not the censored Google.cn. Google checks what ads to deliver by location (determined by IP address) and the language setting of your browser. Despite both of these showing that my language was Chinese and my location was in China the ad did not properly appear.
Eventually, my ad began to be shown on Google.cn. While my ad does not appear every time the keywords are searched, it does periodically appear. (See possible explanation below).
Although there are no search results available for hrw.org, my ad for a censored website did appear on some occasions. (See below for a possible explanation.)
This is a neat way to circumvent Google’s censorship. It may be possible to extend this even further. Mirror sites and alternative URLs for censored web sites can be displayed through the use of Google Ads.
For the techies out there…
www.google.cn resolves to several IP addresses:
$ host www.google.cn
www.google.cn is an alias for cn.l.google.com.
cn.l.google.com has address 220.127.116.11
cn.l.google.com has address 18.104.22.168
cn.l.google.com has address 22.214.171.124
cn.l.google.com has address 126.96.36.199
cn.l.google.com has address 188.8.131.52
cn.l.google.com has address 184.108.40.206
When connecting directly to these IP addresses and manually inserting the Host header (Host: www.google.cn) the normal google.cn appears (with filtered results), but the ad is missing.
When connecting directly to alternate Google IP addresses, such as 220.127.116.11 and 18.104.22.168 and manually inserting the Host header (Host: www.google.cn) the normal google.cn appears (with filtered results), the ad does appear.
It appears that some specific IP addresses assigned to serve www.google.cn do not consistently show the ad, while other Google IP addresses do consistently show the ad.
I am still waiting for an explanation from Google.