4027
Comment:
|
2935
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
#acl hercules:read,write,delete,revert,admin All:read | = ISBI - Internet Search Techniques and Business Intelligence (ISBI/IV2002) 7,5 hp, (Teknik för internetsökning och omvärldsbevakning, in Swedish) = == Requirements == 60 hp Computer and Systems Sciences |
Line 3: | Line 5: |
= ISBI - Teknik för internetsökning och omvärldsbevakning = | == Short description == The course covers techniques for information retrieval and analysis on the Internet. We study fundamentals of document retrieval, search engine principles and algorithms including web site optimization, web mining including analysis of behavior patterns and online reviews, tools for monitoring online news. The course has non-mandatory campus lectures as well as a distance education version. |
Line 7: | Line 10: |
Line 16: | Line 20: |
Fundamentals of Information Retrieval: Boolean, term weight- and vector-space text retrieval models; document similarity measures; quality measures - precision and recall; index of documents and its access methods; morphologic and semantic analysis in text retrieval. | The course gives an insight into techniques for information retrieval and news monitoring on the Internet. We will study the fundamentals of document processing, text summarisation, search engine principles and algorithms, web optimisation methods, as well as business intelligence applications and extraction of information from news using automatic text summarisation. |
Line 18: | Line 22: |
Query analysis: Processing the search word and index using word stemming, query expansion, fuzzy matching, compound splitting and compound joining that increase the quality of search. Other techniques are automatic translation of search words to other languages to make cross language information retrieval. | The course is delivered in English. The students are welcome to use English and Swedish. |
Line 20: | Line 24: |
Information clustering and presentation: Sorting of text flows using automatic clustering and semi automatic clustering. Automatic document summarization removes redundant information from a document and creates a shorter summarized document. Multi document summarization summarizes several documents to one document. Using machine translation to present results in the users native language. | == Goal == The course gives an insight into the techniques for information searching and monitoring applied on the Internet. After the course is finished, the students should be able to: |
Line 22: | Line 27: |
Search Engines: Architecture of a search engine; crawlers and features that hinder crawling; keyword-based retrieval; link analysis and PageRank; optimization of websites for search engines (Search Engine Optimization) and search engine spamming; paid listing; meta-search engines; web directories. Furthermore, there exist authoritative information accessible over the Internet and not visible to ordinary search engines. This material resides on the "invisible web", which is largely comprised of content-rich databases from universities, libraries, associations, businesses, and government agencies. | Compare the models of information retrieval, explain their advantages and disadvantages. |
Line 24: | Line 29: |
Monitoring tools: News archives and indexing tools, news alerts and agents, and RSS based news surveillance tools. | Measure the quality of information retrieval tools. |
Line 26: | Line 31: |
Question-Answering Systems deliver the answer to the question the user has in mind while searching, instead of a ranked list of documents. The three main question-answering approaches are based on Natural Language Processing, Information Retrieval, and question templates. | Explain the principles and algorithms used by major search engines and apply this knowledge for developing one’s own web documents. |
Line 28: | Line 33: |
== Outline == Half speed Credits (p): 7,5 Lectures: 14 lectures x 2 hours Assignments: 3 Laborations: 3 occasions x 2 hours |
Explain how Business Intelligence systems work, their strength and weaknesses. |
Line 35: | Line 35: |
===== Groups ===== Laborations are carried out in groups of maximum two students. Assignments in groups of maximum of four students. Laborations are carried out at university at fixed times under supervision of the course managers. The assignments are carried out at home but there are occasions of supervision where the students can ask questions and get support from the teacher. |
Make a specification of a Business Intelligence system that fulfills certain requirements. |
Line 40: | Line 37: |
===== Distance students ===== The distance student must only participate physical for the exam, the rest of the tasks are solved completely at distance. The distance education students must be present at the campus for the exam, the rest of the tasks can be solved using electronic means of communication. If a distance student has no possibility to form a group then the student is allowed to solve all tasks alone, http://www.dsv.su.se/~eriks/66BI/66BIdist.html. |
Explain and choose language technology tools that increase the quality of document retrieval and filtering. Use the terminology and concepts in information retrieval and business intelligence. http://dsv.su.se/en/education/distance/ISBI/. |
ISBI - Internet Search Techniques and Business Intelligence (ISBI/IV2002) 7,5 hp, (Teknik för internetsökning och omvärldsbevakning, in Swedish)
Requirements
60 hp Computer and Systems Sciences
Short description
The course covers techniques for information retrieval and analysis on the Internet. We study fundamentals of document retrieval, search engine principles and algorithms including web site optimization, web mining including analysis of behavior patterns and online reviews, tools for monitoring online news. The course has non-mandatory campus lectures as well as a distance education version.
Aim
The course gives an insight into the techniques for information searching and monitoring applied on the Internet. After the course is finished, the students should be able to:
- Compare the models of information retrieval, explain their advantages and disadvantages.
- Measure the quality of information retrieval tools.
- Explain the principles and algorithms used by major search engines and apply this knowledge for developing one’s own web documents.
- Explain how Business Intelligence systems work, their strength and weaknesses.
- Make a specification of a Business Intelligence system that fulfills certain requirements.
- Explain and choose language technology tools that increase the quality of document retrieval and filtering.
- Use the terminology and concepts in information retrieval and business intelligence
Syllabus
The course gives an insight into techniques for information retrieval and news monitoring on the Internet. We will study the fundamentals of document processing, text summarisation, search engine principles and algorithms, web optimisation methods, as well as business intelligence applications and extraction of information from news using automatic text summarisation.
The course is delivered in English. The students are welcome to use English and Swedish.
Goal
The course gives an insight into the techniques for information searching and monitoring applied on the Internet. After the course is finished, the students should be able to:
Compare the models of information retrieval, explain their advantages and disadvantages.
Measure the quality of information retrieval tools.
Explain the principles and algorithms used by major search engines and apply this knowledge for developing one’s own web documents.
Explain how Business Intelligence systems work, their strength and weaknesses.
Make a specification of a Business Intelligence system that fulfills certain requirements.
Explain and choose language technology tools that increase the quality of document retrieval and filtering.
Use the terminology and concepts in information retrieval and business intelligence.
http://dsv.su.se/en/education/distance/ISBI/.