Course Details

Course Name
Ingénierie documentaire
Course Type
Course
Institution
Université de Lausanne
Department
Department of Language and Information Sciences
Information
https://applicationspub.unil.ch/interpub/noauth/php/Ud/ficheCours.php?v_semposselected=156&v_langue=en&v_isinterne=&v_enstyid=64491
Curriculum
-
Status
record actively maintained
Course Language
French
Start Date
2017-09-21
recurring
ECTS
6
Lecturer
Michael Piotrowski
PID
Disciplines: Computer Science, Human Language Technologies
Techniques: Pattern Recognition, Text Mining
Objects: Text, Standards, Tools
Access Requirements
-
Description
Objective Students learn how to use the UNIX shell and the standard text processing tools, as well as regular expressions. They obtain an understanding of XML and its representation by the Document Object Model. They learn how to use XSLT for transforming and XQuery for querying XML documents. Students also get an overview of related technologies. Content Document engineering is the computer science discipline that investigates systems for documents in any form and in all media. As with the relationship between software engineering and software, document engineering is concerned with principles, tools and processes that improve our ability to create, manage, and maintain documents. This course focuses on two aspects important for many digital humanities projects: 1. The manipulation of plain text using the standard UNIX text processing tools, and 2. XML and related technologies for modeling, processing, querying, and transforming complex structured documents.
Keywords
Disciplines: Computer Science, Human Language Technologies
Techniques: Pattern Recognition, Text Mining
Objects: Text, Standards, Tools