whitehouse robots
Posted 2003-11-12 @ 16:29:34
Todd Dominey notes that whitehouse.gov has quite a restrictive robots.txt file. Wondering what's left?
ksmith$ wget -r -l inf -nv http://whitehouse.gov
16:03:15 URL:http://www.whitehouse.gov/ [36587/36587]
-> "www.whitehouse.gov/index.html" [1]
FINISHED --16:03:15--
Downloaded: 36,587 bytes in 1 files
The home page. That's it.
©2002-2008 kevin c smith