Course Schedule:
Week 1- (Jan. 18, 2007) Course Overview and Introduction
Primary Readings
- Primary Readings are required for class discussion.
Secondary Readings
- Secondary Readings are supplemental materials related to the topic.
Deliverables
- Personal Introductions
- Syllabus Overview
Week 2 - (Jan. 25, 2007) The Search [Top]
Primary Readings
- The Search: How Google and Its Rivals Rewrote the Rules of Business and Transformed Our Culture by John Battelle. Chapters 1 - 4
Secondary Readings
Deliverables
- Class Work: Discuss the Database of Intentions
- Review Paper: The Search (1 page)
Week 3 - (Feb. 1, 2007) The World of Google [Top]
Primary Readings
- The Search: How Google and Its Rivals Rewrote the Rules of Business and Transformed Our Culture by John Battelle. Chapters 5 - 11 and Epilogue
Secondary Readings
- http://www.jux2.com/ (compare search results)
- GigaBlast (note their government search)
- Relona interactive search
- Brint.com (business technology, economy search)
Deliverables
- Review Paper: The Search (1 page)
Week 4 - (Feb. 8, 2007) Using and Evaluating Search Engines [Top]
Primary Readings
- Google: The Missing Manual, Second Edition By Sarah Milstein, Rael Dornfest, J.D. Biersdorfer, Matthew MacDonald. Chapters 1 - 2.
- Krug, Steve. "How do we really use the web?" from Don't Make Me Think! A Common Sense Approach to Web Usability. 2000. QUE.
- Jansen, B.J. and Pooch, U. (2001). A Review of Web Searching Studies and a Framework for Future Research. Journal of the American Society of Information Science and Technology, 52(3). 235-246.
Secondary Readings
- Choo, C. W., Detlor, B., & Turnbull, D. (2000). Information Seeking on the Web: An Integrated Model of Browsing and Searching. First Monday, 5(2).
- Morrison, J. B., Pirolli, P., & Card, S. K. (2001). A Taxonomic Analysis of What World Wide Web Activities Significantly Impact People's Decisions and Actions. Proceedings of CHI 2001, Seattle, WA.
- Bates, Marica J. (1989). The Design Of Browsing And Berrypicking Techniques For The Online Search Interface. Online Review, 13 (5), 407-431, 1989.
- Navarro-Prieto, R., M. Scaife, et al. (1999). Cognitive Strategies In Web Searching. Human Factors and the Web Conference, Gaithersburg, Maryland.
- Kelly, Diane & Belkin, Nicholas J. (2001) Reading Time, Scrolling and Interaction: Exploring Implicit Sources of User Preferences for Relevance Feedback. Proceedings of SIG. New Orleans, LA. pp 408-409.
Deliverables
- Class Work: Using Search Engines
- Search Site Review Selection (for presentation next week)
- Initial Project or Paper topics discussion
Week 5 - (Feb.15, 2007) Foundations of Web Information Retrieval [Top]
Primary Readings
- Brin, S and Page, L. (1998) The anatomy of a large-scale hypertextual web search engine. World Wide Web 7 Conference.
- Hearst, M. (2000). Next Generation Web Search: Setting Our Sites. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering.
- Woodruff, Allison; Aoki, Paul M.; Brewer, Eric; Gauthier, Paul & Rowe, Lawrence A. (1996) An Investigation of Documents from the World Wide Web. Computer Networks and ISDN Systems. Vol 28, Nos. 7-11, pp. 963-980.
- Ivory, Melody Y & Hearst, Marti A. (2002) Statistical Properties of Highly-Rated Web Sites. Proceedings of ACM SIGCHI. pp 1-8.
Secondary Readings
- Kobayashi, M., & Takeda, K. (2000). Information Retrieval on the Web. ACM Computing Surveys, 32(2).
- Bazaac, D. (2002) The Meta Search Engines: A Web Searcher's Best Friends. October 10, 2002 from evolt.org.
- Thurow, S. (2002) Search Engine Visibility. New Riders. (Bonus Materials!)
- Spinellis, D. (2003) The Decay and Failures of Web References. Communications of the ACM, 46(1):71-77.
Deliverables
- Class Work: Evaluating Search Engines
- Search Site Presentation
- Project or Paper topics discussion
Week 6 - (Feb. 22, 2007) Beyond Text - Searching for Images, Audio & Video Top]
Primary Readings
- Google: The Missing Manual, Second Edition By Sarah Milstein, Rael Dornfest, J.D. Biersdorfer, Matthew MacDonald. Chapter 3.
- Williamson C. & Shneiderman, B. (1992) The dynamic HomeFinder: Evaluating dynamic queries in a real-estate information exploration system. In Proceedings of the fifteenth annual international ACM/SIGIR conference on research and development in information retrieval. pp. 338-346.
Secondary Readings
- Clusty
- Li, Mingzhe; Claypool, Mark; Kinicki, Robert & Nichols, James (2003) Characteristics of Streaming Media Stored on the Internet. Technical Report# WPI-CS-TR-03-18. CS Department, Worcester Polytechnic Institute.
Deliverables
- Topic Review
Week 7 - (Mar. 1, 2007) Informetrics, Webometrics and Web Use metrics [Top]
Primary Readings
- Larson, Ray (1996). Bibliometrics of the World Wide Web: An
Exploratory Analysis of the Intellectual Structure of Cyberspace. Paper
presented at the 59th ASIS Annual Meeting, Baltimore, Maryland.
- Kleinber, Jon. 1998) Authoritative Sources in a Hyperlinked Environment. Journal of the ACM, Vol. 46, No. 5. pp 604-632.
- Turnbull, Don. (1996) Bibliometrics and the Web. Technical Report FIS-12-19-1996-1, University of Toronto.
- Spertus, E. (1997, April 7-11). ParaSite: Mining Structural Information on the Web. Paper presented at the Sixth International
World Wide Web Conference, Santa Clara, CA.
Secondary Readings
- Rousseau, R. (1997). Sitations: an exploratory study. CyberMetics: International Journal of Scientometrics, Informetrics and Bibliometrics, 1(1), 1-7.
- Almind, T. C., & Ingwersen, P. (1997). Informetric Analyses on the World Wide Web: Methodological Approaches to 'Webometrics'. Journal of Documentation, 53(4), 404-426.
- Ingwersen, P. (1998). The Calculation of Web Impact Factors. Journal of Documentation, 54(2), 236-243.
Deliverables
- Topic Review
Week 8 - (Mar. 8, 2007) Web Analytics [Top]
Primary Readings
- Google: The Missing Manual, Second Edition By Sarah Milstein, Rael Dornfest, J.D. Biersdorfer, Matthew MacDonald. Chapter 10.
- Extended Log File Format - W3C Working Draft WD-logfile-960323
- Haigh, Susan and Megarity, Janette. (1998). Measuring
Web Site Usage: Log File Analysis. Network Notes #57. ISSN
1201-4338. Information Technology Services Report, National Library of
Canada.
- Silverstein, C., Henzinger, M., Marias, H., & Moricz, M. Analysis of a Very Large Altavista Query Log Techical Report- Digital Systems Research Center. Octover 26,1998. 1998-014.
- Jansen, B. J., A. Spink, et al. (1998). Real Life Information Retrieval: A Study of User Queries on the Web. SIGIR Forum: A Publication of the Special Interest Group on Information Retrieval. 32: 5-18.
- Pitkow, J. E. (1997). In Search of
Reliable Usage Data on the WWW. Sixth International World Wide Web
Conference, Santa Clara, CA.
Secondary Readings
- WebTrends Log Analyzer Series Datasheet
- Cooley, R. W., B. Mobasher, et al. (1999). "Data Preparation for Mining World Wide Web Browsing Patterns." Knowledge and Information Systems.
- Fayyad, U., G. Piatetsky-Shapiro, et al. (1996). "The KDD Process for
Extracting Useful Knowledge from Volumes of Data." Communications
of the ACM 39(11): 27-34.
- Calore, Michael. (2001). Log File Lowdown. Lycos-WIRED Webmonkey
- Webmonkey Staff (2000-2001) eBusiness: Tracking. Lycos-WIRED Webmonkey.
- Wu, K.-l., P. S. Yu, et al. (1998). "Speedtracer: A web usage mining and analysis tool." IBM Systems Journal 37(1): 89-105.Â
- Recker, M. R. and J. E. Pitkow (1996). Predicting Document Access in Large, Multimedia Repositories. ACM Transactions on Computer-Human Interaction. 3(4), pp 352-375.
- Nielsen, Jakob. (1998). Tracking the Growth of a Site. Alterbox Report.
Systems
Deliverables
- Topic Review
- Class Work: Web Analytics Assignment Discussion
Week 9 - (Mar. 15, 2007) Spring Break - No Scheduled Class [Top]
Week 10 - (Mar. 22, 2007) No Class - IA Summit
Week 11 - (Mar. 29, 2007) Search Engine Optimization & Marketing [Top]
Primary Readings
- Google: The Missing Manual, Second Edition By Sarah Milstein, Rael Dornfest, J.D. Biersdorfer, Matthew MacDonald. Chapter 9.
- Rosenfeld, L., Wiggins, R. Using search analytics to diagnose what's ailing your IA | Slideshow
Secondary Readings
Deliverables
- Topic Review
- Web Analytics Reports Due
Week 12 - (Apr. 5, 2007) Web Advertising [Top]
Primary Readings
- Google: The Missing Manual, Second Edition By Sarah Milstein, Rael Dornfest, J.D. Biersdorfer, Matthew MacDonald. Chapter 9.
Secondary Readings
Deliverables
- Topic Review
Week 13 - (Apr. 12, 2007) Information Filtering [Top]
Primary Readings
- Belkin, N. J. and W. B. Croft (1992). Information Filtering and Information Retrieval: Two Sides of the Same Coin? Communications of the ACM 35(12): 29-38.
- Resnick, P. and Varian, H. (1997). Recommender Systems. Communications of the ACM 40(3).
- Turnbull, Don and Efron, Miles (2006) OpenChoice: A Platform for Web Content Classification & Filtering Workshop paper for the Open Source Workshop at the 15th International World Wide Web Conference. Edinburgh, Scotland. May 23, 2006.
- AIRWeb: Adversarial Information Retrieval on the Web
Secondary Readings
- Terveen, L., W. Hill, et al. (1997). PHOAKS: A System for Sharing Recommendations. Communications of the ACM 40(3): 59-62.
- Balabanovic, M. and Y. Shoham (1997). Fab: Content-Based, Collaborative Recommendation. Communications of the ACM 40(3): 66-72.
- Goldberg, D., D. Nichols, et al. (1992). Using Collaborative Filtering to Weave an Information Tapestry. Communications of the ACM 35(12): 61-70.
Deliverables
- "Future of Search" Paper Due
- Class Discussion: Is Filtering a Necessary Evil? The Costs of Content Filtering
- Topic Review
Week 14
- (Apr. 19, 2007) Federated Search, Meta-Search and Digital Libraries
Primary Readings
- Callery, A. (1996). Yahoo! Cataloging the Web. Untangling the Web Conference, University of California, Santa Barbara.
- Borgman, C.L. (1999) What are Digital Libraries? Competing Visions. Information Processing and Management.
- Hane, Paula J. (2003) The Truth About Federated Searching. Information Today Magazine.
- Linden, Greg (2007) The End of Federated Search?
- Ke, Y., Deng, L, et al. (2005) Web dynamics and their ramifications for the development of Web search engines.
- Kules, B. Kustanowitz, J. & Shneiderman, B. (2006) Categorizing web search results into meaningful and stable categories using fast-feature techniques. Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries. Chapel Hill, NC, USA
Secondary Readings
- Dogpile (meta search)
- Lossau, Norbert (2004) Search Engine Technology and Digital Libraries Libraries Need to Discover the Academic Internet. D-Lib Magazine. June 2004, Volume 10 Number 6
- WebFeat, Inc. (2005) University of Pittsburgh: Custom Branded Zoom! Improves Use of over 200 Scholarly Resources. (Case Study)
- The Shibboleth Project
- Innovative Interfaces: Digital Collections
- Intute (education and research search)
- OAIster (academic search using metadata)
- Online Medical Search (medical metasearch)
- Meng, W., Yu, C., Liu, K. (2002) Building efficient and effective metasearch engines.ACM Computing Surveys, 34:1.
Deliverables
Week 15 - (Apr. 26, 2007) Intranet, Enterprise and Specialized Search
Primary Readings
- Li, H., Cao, Y, et al. (2005) A new approach to intranet search based on information extraction.Proceedings of the 14th ACM international conference on Information and knowledge management (CIKM).
- Hawking, D. (2004) Challenges in enterprise search. Proceedings Fifteenth Australasian Database Conference.
- Rajat Mukherjee & Jianchang Mao. (2004) Enterprise Search: Tough Stuff. ACM Queue Magazine. Volume 2 , Issue 2 (April 2004) pp 36-46.
- Stenmark, Dick (2005). One week with a corporate search engine: A time-based analysis of intranet information seeking. Proceedings of AMCIS 2005, Omaha, Nebraska, August 11-14, 2005, pp. 2306-2316.
- Farnum, C. (2007) Tuning up Site Search. | Slideshow Presentation at the Information Architecture Summit. Las Vegas, NV.
Secondary Readings
- OmniFind Enterprise Edition brochure
- Google Enterprise Search Solutions
- Microsoft Enterprise Search for Business Managers
- White paper: Evaluation guide for Office SharePoint Server 2007 for Search
Deliverables
- Class Work: Research Paper or Projects Review
Week 16 - (May 3, 2007) No Class - CHI 2007