INFORMATION RETRIEVAL - 2020/1
Module code: COM2034
Module Overview
This module will provide students with an understanding of information retrieval. This relates to multimedia data (principally text, but also image, video and audio) stored for, presented on, and consumed from, the web amongst other sources. The module covers fundamental techniques and strategies of information retrieval used in a variety of online applications such as web-search engines, document matching systems, and business storage and analytics.
Module provider
Computer Science
Module Leader
GILLAM Lee (Computer Sci)
Number of Credits: 15
ECTS Credits: 7.5
Framework: FHEQ Level 5
Module cap (Maximum number of students): N/A
Overall student workload
Independent Learning Hours: 104
Lecture Hours: 24
Laboratory Hours: 22
Module Availability
Semester 1
Prerequisites / Co-requisites
N/A
Module content
· Retrieval, browsing, user information needs, and other core concerns.
· Notions of structured, unstructured and semi-structured data
· A generic architecture for information retrieval
· Spiders/crawlers, stopwords and keywords, indexing and stemming.
· Boolean retrieval, ranked retrieval, and vector spaces
· Query expansion and its relationship with the Semantic Web.
· Assessing relevance - precision and recall
· Metadata and semantics.
· Databases, data normalization and de-normalization.
· The challenges presented by “Big Data”
· NoSQL and Cloud Computing for distributed and scalable treatment of “Big Data”.
· Image and video features and classifications that enable access to other media types
· Exemplar applications, including web-based search engines, organisation-wide archives, business data collections, and media collections.
Assessment pattern
Assessment type | Unit of assessment | Weighting |
---|---|---|
Coursework | COURSEWORK | 20 |
Examination | EXAM 2 HOURS | 80 |
Alternative Assessment
N/A
Assessment Strategy
The assessment strategy is designed to provide students with the opportunity to demonstrate :
Explaining theories behind search and assess the impacts on search performance inherent in variations in their construction
Elaborating a range of techniques for analysing, modelling, and retrieving text documents
Contrasting different kinds of applications, and their integration, in satisfying specific user information needs
Elaborating, contrasting and evaluating information models that support efficient storage, retrieval and browsing, in a variety of applications.
Contrasting the need for efficiency of data storage with the needs of batch access to large datasets.
Applying appropriate, standard, metadata sets and semantics to ensure effective data storage and curation.
Identifying the important features for storage, retrieval and browsing of non-textual data
Thus, the summative assessment for this module consists of:
A coursework that will involve applying and evaluating various concepts and principles introduced in lectures and tested in lab sessions. Specific software and analytical approaches will be explored in these assessments. Submissions will be made through the VLE, with the deadline towards the end of the module. The coursework may assess against all relevant learning outcomes addressed suitably in advance of the deadline.
2-hour written unseen written examination comprising a mixture of short answer and discussion questions. The examination paper may assess against all learning outcomes.
Formative assessment and feedback
Students will be progressively completing structured lab workbooks where submission of each is necessary to progress to the next. On submission, informative solutions are also provided such that students will be able to gauge their progress as the module progresses.
Module aims
- Help students to gain an understanding of the current study of information retrieval
- Provide practical understanding of how data are represented for storage, analysis and use in particular applications.
Learning outcomes
Attributes Developed | ||
1 | Explain theories behind search and assess the impacts on search performance inherent in variations in their construction | KC |
2 | Elaborate a range of techniques for analysing, modelling, and retrieving text documents | KCT |
3 | Contrast different kinds of applications, and their integration, in satisfying specific user information needs | KCT |
4 | Elaborate, contrast and evaluate information models that support efficient storage, retrieval and browsing, in a variety of applications. | KC |
5 | Contrast the need for efficiency of data storage with the needs of batch access to large datasets | KCT |
6 | Apply appropriate, standard, metadata sets and semantics to ensure effective data storage and curation. | KP |
7 | Identify the important features for storage, retrieval and browsing of non-textual data | KCT |
Attributes Developed
C - Cognitive/analytical
K - Subject knowledge
T - Transferable skills
P - Professional/Practical skills
Methods of Teaching / Learning
The learning and teaching strategy is designed to:
Develop an understanding for the principles and role of information retrieval and closely related applications
The learning and teaching methods include:
· Lectures, including case studies•
· Occasional set reading•
· In-class discussions
· In-class and out-of-class exercises•
· Lab sessions•
· Coursework.
Indicated Lecture Hours (which may also include seminars, tutorials, workshops and other contact time) are approximate and may include in-class tests where one or more of these are an assessment on the module. In-class tests are scheduled/organised separately to taught content and will be published on to student personal timetables, where they apply to taken modules, as soon as they are finalised by central administration. This will usually be after the initial publication of the teaching timetable for the relevant semester.
Reading list
https://readinglists.surrey.ac.uk
Upon accessing the reading list, please search for the module using the module code: COM2034
Programmes this module appears in
Programme | Semester | Classification | Qualifying conditions |
---|---|---|---|
Computer Science BSc (Hons) | 1 | Compulsory | A weighted aggregate mark of 40% is required to pass the module |
Computing and Information Technology BSc (Hons) | 1 | Compulsory | A weighted aggregate mark of 40% is required to pass the module |
Please note that the information detailed within this record is accurate at the time of publishing and may be subject to change. This record contains information for the most up to date version of the programme / module for the 2020/1 academic year.