Modiwl ICE-4721:
Natural Language Processing
Natural Language Processing 2024-25
ICE-4721
2024-25
School Of Computer Science And Electronic Engineering
Module - Semester 2
20 credits
Module Organiser:
William Teahan
Overview
Indicative content includes:
- NLP programming in Python using the NLTK and text processing methods using Unix tools such as sed, awk.
- Topics in Computational Linguistics and Natural Language Processing: Zipf’s Law; Heap’s Law; the sparse data problem; the zero frequency problem; regular expressions; n-grams; language modelling; NLP pipeline e.g. tokenization -> word segmentation -> part of speech (POS) tagging -> phrase chunking -> parsing -> named entity recognition -> information extraction -> question answering.
- Topics in Information Retrieval (IR) – systems, theory and technologies: Boolean search; textual conflation e.g. stemming and stopword removal; probabilistic IR model; Vector Space IR model; relevance feedback; text categorization and information filtering; information extraction; question answering; IR systems and search engines e.g. Google’s PageRank, IBM’s Web Fountain; search engine optimization; evaluation.
Assessment Strategy
-threshold -Equivalent to 50%.Uses key areas of theory or knowledge to meet the Learning Outcomes of the module. Is able to formulate an appropriate solution to accurately solve tasks and questions. Can identify individual aspects, but lacks an awareness of links between them and the wider contexts. Outputs can be understood, but lack structure and/or coherence. -good -Equivalent to the range 60%-69%.Is able to analyse a task or problem to decide which aspects of theory and knowledge to apply. Solutions are of a workable quality, demonstrating understanding of underlying principles. Major themes can be linked appropriately but may not be able to extend this to individual aspects. Outputs are readily understood, with an appropriate structure but may lack sophistication. -excellent -Equivalent to the range 70%+.Assemble critically evaluated, relevant areas of knowledge and theory to constuct professional-level solutions to tasks and questions presented. Is able to cross-link themes and aspects to draw considered conclusions. Presents outputs in a cohesive, accurate, and efficient manner.
Learning Outcomes
- Compare and contrast NLP methods and technologies and how they can be applied to constructing non-trivial systems that process natural language.
- Devise a non-trivial NLP system using the NLTK in Python and Unix-based text processing.
- Separate techniques used for Computational Linguistics, Natural Language Processing, and Information Retrieval.
Assessment method
Coursework
Assessment type
Summative
Description
NLP Laboratories
Weighting
50%
Due date
05/05/2023
Assessment method
Coursework
Assessment type
Summative
Description
NLP Assignment
Weighting
30%
Due date
26/05/2023
Assessment method
Exam (Centrally Scheduled)
Assessment type
Summative
Description
NLP Test
Weighting
20%
Due date
31/03/2023