INFORMATION RETRIEVAL - 2020/1

Module code: COM2034

Module Overview

This module will provide students with an understanding of information retrieval. This relates to multimedia data (principally text, but also image, video and audio) stored for, presented on, and consumed from, the web amongst other sources. The module covers fundamental techniques and strategies of information retrieval used in a variety of online applications such as web-search engines, document matching systems, and business storage and analytics.

Module provider

Computer Science

Module Leader

GILLAM Lee (Computer Sci)

Number of Credits: 15

ECTS Credits: 7.5

Framework: FHEQ Level 5

Module cap (Maximum number of students): N/A

Overall student workload

Independent Learning Hours: 104

Lecture Hours: 24

Laboratory Hours: 22

Module Availability

Semester 1

Prerequisites / Co-requisites

N/A

Module content

·         Retrieval, browsing, user information needs, and other core concerns.

·         Notions of structured, unstructured and semi-structured data

·         A generic architecture for information retrieval

·         Spiders/crawlers, stopwords and keywords, indexing and stemming.

·         Boolean retrieval, ranked retrieval, and vector spaces

·         Query expansion and its relationship with the Semantic Web.

·         Assessing relevance - precision and recall

·         Metadata and semantics.

·         Databases, data normalization and de-normalization.

·         The challenges presented by “Big Data”

·         NoSQL and Cloud Computing for distributed and scalable treatment of “Big Data”.

·         Image and video features and classifications that enable access to other media types

·         Exemplar applications, including web-based search engines, organisation-wide archives, business data collections, and media collections. 

Assessment pattern

Assessment type Unit of assessment Weighting
Coursework COURSEWORK 20
Examination EXAM 2 HOURS 80

Alternative Assessment

N/A

Assessment Strategy

The assessment strategy is designed to provide students with the opportunity to demonstrate :



  • Explaining theories behind search and assess the impacts on search performance inherent in variations in their construction


  • Elaborating a range of techniques for analysing, modelling, and retrieving text documents


  • Contrasting different kinds of applications, and their integration, in satisfying specific user information needs


  • Elaborating, contrasting and evaluating information models that support efficient storage, retrieval and browsing, in a variety of applications.


  • Contrasting the need for efficiency of data storage with the needs of batch access to large datasets.


  • Applying appropriate, standard, metadata sets and semantics to ensure effective data storage and curation.


  • Identifying the important features for storage, retrieval and browsing of non-textual data



 

Thus, the summative assessment for this module consists of:



  • A coursework that will involve applying and evaluating various concepts and principles introduced in lectures and tested in lab sessions. Specific software and analytical approaches will be explored in these assessments. Submissions will be made through the VLE, with the deadline towards the end of the module. The coursework may assess against all relevant learning outcomes addressed suitably in advance of the deadline.


  • 2-hour written unseen written examination comprising a mixture of short answer and discussion questions. The examination paper may assess against all learning outcomes.



 

Formative assessment and feedback

Students will be progressively completing structured lab workbooks where submission of each is necessary to progress to the next. On submission, informative solutions are also provided such that students will be able to gauge their progress as the module progresses.

Module aims

  • Help students to gain an understanding of the current study of information retrieval
  • Provide practical understanding of how data are represented for storage, analysis and use in particular applications.

Learning outcomes

Attributes Developed
1 Explain theories behind search and assess the impacts on search performance inherent in variations in their construction KC
2 Elaborate a range of techniques for analysing, modelling, and retrieving text documents KCT
3 Contrast different kinds of applications, and their integration, in satisfying specific user information needs KCT
4   Elaborate, contrast and evaluate information models that support efficient storage, retrieval and browsing, in a variety of applications. KC
5 Contrast the need for efficiency of data storage with the needs of batch access to large datasets KCT
6 Apply appropriate, standard, metadata sets and semantics to ensure effective data storage and curation. KP
7  Identify the important features for storage, retrieval and browsing of non-textual data KCT

Attributes Developed

C - Cognitive/analytical

K - Subject knowledge

T - Transferable skills

P - Professional/Practical skills

Methods of Teaching / Learning

The learning and teaching strategy is designed to:

Develop an understanding for the principles and role of information retrieval and closely related applications

 

The learning and teaching methods include:

·         Lectures, including case studies•  

·         Occasional set reading•  

·         In-class discussions

·         In-class and out-of-class exercises•  

·         Lab sessions•  

·         Coursework.

Indicated Lecture Hours (which may also include seminars, tutorials, workshops and other contact time) are approximate and may include in-class tests where one or more of these are an assessment on the module. In-class tests are scheduled/organised separately to taught content and will be published on to student personal timetables, where they apply to taken modules, as soon as they are finalised by central administration. This will usually be after the initial publication of the teaching timetable for the relevant semester.

Reading list

https://readinglists.surrey.ac.uk
Upon accessing the reading list, please search for the module using the module code: COM2034

Programmes this module appears in

Programme Semester Classification Qualifying conditions
Computer Science BSc (Hons) 1 Compulsory A weighted aggregate mark of 40% is required to pass the module
Computing and Information Technology BSc (Hons) 1 Compulsory A weighted aggregate mark of 40% is required to pass the module

Please note that the information detailed within this record is accurate at the time of publishing and may be subject to change. This record contains information for the most up to date version of the programme / module for the 2020/1 academic year.