AUDIO AND VIDEO PROCESSING - 2024/5

Module code: EEE3042

Module Overview

Expected prior learning: Mathematics equivalent to Year 2 of EEE Programmes.

Module purpose: Using lectures and examples, this module will provide the fundamental concepts needed to understand the operation of modern media processing and transmission systems delivering high quality audio-visual content.

Module EEE3042 contributes to the development of student’s knowledge in audio and video processing and coding, which may be useful for their taking of other modules such as:


  • EEEM030 Speech & Audio Processing and Recognition

  • EEE3032 Computer Vision and Pattern Recognition

  • EEE3008 Digital Signal Processing

  • EEEM071 Advanced Topics in Computer Vision and Deep Learning



Module EEE3042 has components on audio processing which provides useful knowledge for the EEEM030 module. The EEE3042 module discuss the applications of digital signal processing and thus the knowledge learned in this module can help reinforce student’s understanding of the fundamental concepts and theories learned from the module EEE3008. The video processing knowledge gained in EEE3042 will be useful for student to learn computer vision modules such as EEE3032 and EEEM071.

Module EEE3042 also benefits from knowledge gained from other modules such as:


  • EEE3008 Digital Signal Processing

  • EEE1033 Computer and Digital Logic

  • EEE1035 Programming in C



The modules EEE1033 and EEE1035 provide students with useful skills in programming, which will be beneficial for them to complete the computer programming based coursework components, by turning the signal processing theories and methods into working program codes. EEE3008 discuss fundamental concepts and methods in digital signal processing which will enable students to use such knowledge to audio and video data.

Module provider

Computer Science and Electronic Eng

Module Leader

WANG Wenwu (CS & EE)

Number of Credits: 15

ECTS Credits: 7.5

Framework: FHEQ Level 6

Module cap (Maximum number of students): N/A

Overall student workload

Independent Learning Hours: 87

Lecture Hours: 33

Guided Learning: 5

Captured Content: 25

Module Availability

Semester 2

Prerequisites / Co-requisites

None.

Module content

 

Part A – Audio processing

[1-2]     Characteristics of speech and audio signals, and audio processing chain.

[3-4]     Principles of audio capture and coding methods. Need for source compression. Definition of quantisation. Scalar and vector quantisation.

[5-6]  Probability density function, auto-correlation, cross-correlation, stationarity etc.

[7-9]     Fundamental methods of speech processing. Voiced and unvoiced speech. Effects of windowing. Short and long term modelling of speech.

[10-12] Simple coding methods (PCM, APCM, ADPCM). Analysis and Synthesis coding methods. Analysis by synthesis coding methods. Harmonic coding.

[13-14] Audio reproduction formats and coding technologies, and spatial audio and 3D audio systems. File formats and standards.

[15]      Perceptual evaluation of audio quality

 

Part B – Video processing

[1-2]     Introduction to 2D-3D video applications and systems

[3-5]     Video capture and coding principles. Image and video formats. Spatial and temporal redundancy. DCT. Motion estimation and compensation. Coding complexity. RD optimisation.

[6-9]     Networked video concepts: processing, transmission/communications, and rendering.

[10-12] Media adaptation concepts: adaptation decision and content adaptation

[13-14] Introduction to 3D video compression. Depth perception cues. 3D video signal formats. 3D video compression approaches.

[15]      Perceptual video quality assessment

Assessment pattern

Assessment type Unit of assessment Weighting
Coursework COURSEWORK 1 15
Coursework COURSEWORK 2 15
Examination INVIGILATED (CLOSE BOOK) EXAM WITHIN 2HR WINDOW 70

Alternative Assessment

N/A

Assessment Strategy

The assessment strategy for this module is designed to provide students with the opportunity to demonstrate the level of achievement made towards reaching the expected learning outcomes. Two course works are designed which provide students with opportunities to apply the concepts learned in classes and exercise on some audio/video processing and programming skills. A written examination has been designed that will assess the knowledge and assimilation of the necessary definitions and terminology, concepts and theory of 2D and 3D audio-visual content capture, compression, transmission, adaptation, media quality evaluation, and users’ experience. The examination also aims to assess the students’ ability to analyse common media processing problems and apply necessary solutions in accordance with their understanding of how media processing can address those challenges and limitations that can be faced in real-world situations.

Thus, the summative assessment for this module consists of the following.


  • Coursework 1 (15%)



           This coursework is designed for the audio processing part, e.g. DFT transform, DCT transform, etc.


  • Coursework 2 (15%)



           This coursework is designed for the video processing part, e.g. image/video coding, etc.


  • Written exam (70%)



These deadlines are indicative. For confirmation of exact date and time, please check the Departmental assessment calendar issued to you.

 

Formative assessment and feedback

For the module, students will receive formative assessment/feedback in the following ways.


  • During lectures, by question and answer sessions

  • During revision & tutorial classes

  • By means of unassessed tutorial problem sheets



 

Module aims

  • Provide students with a basic understanding of the media signal processing systems that are ubiquitous in our modern age.
  • Introduce them to the essential characteristics of the signals, their capturing, encoding, transmission and assessment of their perceived quality.
  • The module also aims to provide opportunities for students to learn about the Surrey Pillars listed below.

Learning outcomes

Attributes Developed
Ref
001 Demonstrate a knowledge of the nature of audio and video signals. K C1
002 Identify the processing, transmission and rendering stages of networked content distribution. KCT C2
003 Demonstrate an understanding of the practical implementation issues related to signal compression for communication applications CT C4,C5
004 Construct knowledge on important features of different compression standards and identify their future requirements for new applications KC C7
005 Describe and discuss the principles of media adaptation through the interaction between system capability and user experience. KC C6
006 Understand the differences of the assessment methods for perceived audio and video quality and report on them in written format KCPT C13, C16, C17

Attributes Developed

C - Cognitive/analytical

K - Subject knowledge

T - Transferable skills

P - Professional/Practical skills

Methods of Teaching / Learning

The learning and teaching strategy is designed to achieve the following aims.


  • Directive learning through classroom teaching

  • Tutorial to enable students to check and clarify their understanding of the material



 

Learning and teaching methods include the following.

Lectures (3-hour lectures per week x 10 weeks)

Indicated Lecture Hours (which may also include seminars, tutorials, workshops and other contact time) are approximate and may include in-class tests where one or more of these are an assessment on the module. In-class tests are scheduled/organised separately to taught content and will be published on to student personal timetables, where they apply to taken modules, as soon as they are finalised by central administration. This will usually be after the initial publication of the teaching timetable for the relevant semester.

Reading list

https://readinglists.surrey.ac.uk
Upon accessing the reading list, please search for the module using the module code: EEE3042

Other information

The Curriculum Framework at Surrey is committed to developing graduates with strengths in five pillars: Digital Capabilities, Employability, Sustainability, Global and Cultural Capabilities, and Resourcefulness and Resilience. This module is designed to allow students to develop knowledge, skills and capabilities in the following areas:

Digital capabilities: Students will develop skills in applications of digital signal processing methods to audio and video, which is a key digital technology in electronic engineering. They will gain practical skills in audio and video processing experience via coursework components with exercise of implementing signal processing algorithms via computing programming such as MATLAB.

Employability: This module provides foundational skills in digital signal processing, audio processing (such as audio capture, time frequency analysis, audio coding, audio perception, and audio reproduction), video processing (such as video coding, network coding, media adaptation, and 3D media), which are all important topics for a wide range of industry applications such as image/video processing, speech processing, digital communications, robotic audition and video, and artificial intelligence.

Sustainability: This module discusses the issues of efficiency in audio/video signal storage/transmission, and discusses a range of compression and coding techniques, which are important for sustainable use of communication bandwidth in digital communications, sustainable use of digital storage space, and sustainable use of computing resources.

Resourcefulness and Resilience: This module develops student skills in using the audio and video processing methods they have learned in lecture material to solve practical problems designed in the tutorial questions, exams, and computer programming based coursework.

Please note that the information detailed within this record is accurate at the time of publishing and may be subject to change. This record contains information for the most up to date version of the programme / module for the 2024/5 academic year.