As part of “Book on Book” will be information on Physical Libraries and Digital Libraries with internet crawling and data capture for achiving purposes an important part of the informational cataloguing area.
https://www.loc.gov/preservation/digital/formats/intro/intro.shtml
You can view a huge junk of internet that has be crawled and extracted on this site below but getting at it seems a bit technical.
This link below is a paper that has been written based upon the use of the Common Crawl Corpus data looking where geospatial information is being used on the internet.
No comments:
Post a Comment