12 Research Sites

Class 12- Research Sites

Focus on Academic research sites, basically sites to help you cite.

So far we have been focusing on the surface web, we've been connected to a bunch of html pages (they're available right there at the "Surface"). Today we will talk about the "hidden web" or the "deep web". The idea is that there are query forms and behind that there are documents.

For Example: You go to the library and search by a query form (Title, Author) so instead of a page for each book, they have a form that gets you to each books. 60 largest deep website contain approximatley 750 terrabytes of data (BIG) "Is it enough to care about… YES" Estimated to be about 300,000 query databases out there.

Why do we care?? Does Google Go to it? NOPE. So we are ignoring vast subsets of data that we could use. So we need to use specialized search engines.

Specialized search engines have three important attributes 3

1. Comprehensive - When you go to a deep web search engine that it gets enough of (you'd love all) journals or websites you would like it to reach.
2. Integrated - Think about all the different kinds of forms out there they you can fill out and get data from, you'd like to be able to type something into some form and the search engine can use that to fill out all the different variants of forms (difficult). Integrated to work with all different forms equally well.
3. Transparent - When you go to the deep web search engine you want to know what you are search engine is searching (what journals its searching etc.) Google Scholar doesn't tell you

Looking at 4 Different Kind of Sites

1. Deep Web Search tools
a. General
b. Specialized (ex. Science, business, medicine)
2. Library and Book Based Sites
3. Web Directories - a evaluated and vetted set of resources that are categorized (someone has already deemed sites to be useful)
4. General References and Answers

Examples:

Google Scholar

  • When first came out five years everyone thought it would be a great academic search tool
  • Looks different (Articles, Authors, Were it comes from, Cited by)
  • You can setup so it says when these documents are available at your library
  • Can limit searches to certain date ranges and just get summaries or citations
  • Used to be some real problems but it's been fixed
    • Example: Search for A = x Search for The = Y Search for A or the = only x (Fixed though)
    • Last years theme was that Google Scholar was the worst piece of software ever, fingers were threatened to be broken. A day's worth of work has made big changes
    • Can search intitle: or allintitle: but also can search only within subject areas (Can't do in Google itself)
      • Some documents aren't categorized and will lose them if you specify but subjects (even if you select all subjects!)
  • Can specify certain dates
  • Use it as a nice place to start, but be careful

Scirus

  • Nice tool "Works like a champ"
  • Doesn't cover as much as Google Scholar
  • Limit to certain Journal Sources, Date, Refinement

Google Books

  • If anyone ever did a report on Google Books would keep you busy on a day by day basis
  • Huge story because Google thought it was right and good to take out of copyright and scanning and put on web (easier access). Some people saw this as infringing on authors rights and evil (Contradictory to Googles "Do no Evil" policy)
  • Searches give you results about whole books, reports, and authors

Search Tool at U of M Library

  • Will in real time search through multiple databases (can also use tags to get into data)

Biznar

  • Business deep web site (specialized)
  • Information should be pretty reliable (and filterable)
  • Same thing at Bnet

WolframAlpha

  • Not a search engine, but it attempts to answer the question it thinks your are trying to ask

Yahoo Directory

  • Can go into Business Economy
    • Example you search Airline Industry and it will return to you directory categories
Unless otherwise stated, the content of this page is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License