4027
Comment:
|
2521
|
Deletions are marked like this. | Additions are marked like this. |
Line 3: | Line 3: |
= ISBI - Teknik för internetsökning och omvärldsbevakning = | = ISBI - Internet Search Techniques and Business Intelligence (ISBI/IV2002) 7,5 hp, (Teknik för internetsökning och omvärldsbevakning, in Swedish) = |
Line 16: | Line 16: |
Fundamentals of Information Retrieval: Boolean, term weight- and vector-space text retrieval models; document similarity measures; quality measures - precision and recall; index of documents and its access methods; morphologic and semantic analysis in text retrieval. | The course gives an insight into techniques for information retrieval and news monitoring on the Internet. We will study the fundamentals of document processing, text summarisation, search engine principles and algorithms, web optimisation methods, as well as business intelligence applications and extraction of information from news using automatic text summarisation. |
Line 18: | Line 18: |
Query analysis: Processing the search word and index using word stemming, query expansion, fuzzy matching, compound splitting and compound joining that increase the quality of search. Other techniques are automatic translation of search words to other languages to make cross language information retrieval. | The course is delivered in English. The students are welcome to use English and Swedish. |
Line 20: | Line 20: |
Information clustering and presentation: Sorting of text flows using automatic clustering and semi automatic clustering. Automatic document summarization removes redundant information from a document and creates a shorter summarized document. Multi document summarization summarizes several documents to one document. Using machine translation to present results in the users native language. | == Goal == The course gives an insight into the techniques for information searching and monitoring applied on the Internet. After the course is finished, the students should be able to: |
Line 22: | Line 23: |
Search Engines: Architecture of a search engine; crawlers and features that hinder crawling; keyword-based retrieval; link analysis and PageRank; optimization of websites for search engines (Search Engine Optimization) and search engine spamming; paid listing; meta-search engines; web directories. Furthermore, there exist authoritative information accessible over the Internet and not visible to ordinary search engines. This material resides on the "invisible web", which is largely comprised of content-rich databases from universities, libraries, associations, businesses, and government agencies. | Compare the models of information retrieval, explain their advantages and disadvantages. |
Line 24: | Line 25: |
Monitoring tools: News archives and indexing tools, news alerts and agents, and RSS based news surveillance tools. | Measure the quality of information retrieval tools. |
Line 26: | Line 27: |
Question-Answering Systems deliver the answer to the question the user has in mind while searching, instead of a ranked list of documents. The three main question-answering approaches are based on Natural Language Processing, Information Retrieval, and question templates. | Explain the principles and algorithms used by major search engines and apply this knowledge for developing one’s own web documents. |
Line 28: | Line 29: |
== Outline == Half speed Credits (p): 7,5 Lectures: 14 lectures x 2 hours Assignments: 3 Laborations: 3 occasions x 2 hours |
Explain how Business Intelligence systems work, their strength and weaknesses. |
Line 35: | Line 31: |
===== Groups ===== Laborations are carried out in groups of maximum two students. Assignments in groups of maximum of four students. Laborations are carried out at university at fixed times under supervision of the course managers. The assignments are carried out at home but there are occasions of supervision where the students can ask questions and get support from the teacher. |
Make a specification of a Business Intelligence system that fulfills certain requirements. |
Line 40: | Line 33: |
===== Distance students ===== The distance student must only participate physical for the exam, the rest of the tasks are solved completely at distance. The distance education students must be present at the campus for the exam, the rest of the tasks can be solved using electronic means of communication. If a distance student has no possibility to form a group then the student is allowed to solve all tasks alone, http://www.dsv.su.se/~eriks/66BI/66BIdist.html. |
Explain and choose language technology tools that increase the quality of document retrieval and filtering. Use the terminology and concepts in information retrieval and business intelligence. http://dsv.su.se/en/education/distance/ISBI/. |
ISBI - Internet Search Techniques and Business Intelligence (ISBI/IV2002) 7,5 hp, (Teknik för internetsökning och omvärldsbevakning, in Swedish)
Aim
The course gives an insight into the techniques for information searching and monitoring applied on the Internet. After the course is finished, the students should be able to:
- Compare the models of information retrieval, explain their advantages and disadvantages.
- Measure the quality of information retrieval tools.
- Explain the principles and algorithms used by major search engines and apply this knowledge for developing one’s own web documents.
- Explain how Business Intelligence systems work, their strength and weaknesses.
- Make a specification of a Business Intelligence system that fulfills certain requirements.
- Explain and choose language technology tools that increase the quality of document retrieval and filtering.
- Use the terminology and concepts in information retrieval and business intelligence
Syllabus
The course gives an insight into techniques for information retrieval and news monitoring on the Internet. We will study the fundamentals of document processing, text summarisation, search engine principles and algorithms, web optimisation methods, as well as business intelligence applications and extraction of information from news using automatic text summarisation.
The course is delivered in English. The students are welcome to use English and Swedish.
Goal
The course gives an insight into the techniques for information searching and monitoring applied on the Internet. After the course is finished, the students should be able to:
Compare the models of information retrieval, explain their advantages and disadvantages.
Measure the quality of information retrieval tools.
Explain the principles and algorithms used by major search engines and apply this knowledge for developing one’s own web documents.
Explain how Business Intelligence systems work, their strength and weaknesses.
Make a specification of a Business Intelligence system that fulfills certain requirements.
Explain and choose language technology tools that increase the quality of document retrieval and filtering.
Use the terminology and concepts in information retrieval and business intelligence.
http://dsv.su.se/en/education/distance/ISBI/.