-
Presentation
Presentation
This course focuses on "data engineering" and its intersection with "data science". In this context, it is intended that students gain technical skills in several independent but related topics. The most relevant areas of this course are databases and programming, which are the fundamental skills one needs to be able to play the role of "data engineer" in academic and/or industrial projects. The inclusion of the course in the masters program is justified with the importance of data collection, validation and processing skills in order to be able to have data that can be "explored" with the knowledge acquired in the other curricular units.
-
Class from course
Class from course
-
Degree | Semesters | ECTS
Degree | Semesters | ECTS
Master Degree | Semestral | 6
-
Year | Nature | Language
Year | Nature | Language
1 | Mandatory | Português
-
Code
Code
ULHT6347-23271
-
Prerequisites and corequisites
Prerequisites and corequisites
Not applicable
-
Professional Internship
Professional Internship
Não
-
Syllabus
Syllabus
- .Introduction to Data Engineering
- Git & GitHub
- Introduction to version control systems
- Learning elementary work processes using the Git software and the GitHub online platform
- Databases & SQL
- Relational Databases
- SQL language
- SQL Injection
- Programming with PHP
- From the data extraction and data processing points of view
- Algoritmic complexity and efficiency
- It's importance when dealing with large amounts of data
- Programming with Python
- From the exploratory data analysis point of view
- Jupiter notebook
- Production of packages for publishing and distributing software ("deployment")
- Linux
- Introduction to the GNU/Linux operating system
- File system navigation (commands)
- Commands for process control
- Advanced Data Engineering Tools
- Introductory presentation of advanced tools such as:
- Hadoop
- Cassandra DB
- Introductory presentation of advanced tools such as:
-
Objectives
Objectives
Students are expected to learning technical skills related with:
- Version control (Git & GitHub)
- Relational Data Bases (e.g. MySQL) and SQL
- Programming with the PHP language, focused on the interaction with relational databases
- Programming with the Python language
- Elementary notions of algorithmo complexity and efficiency
- Linux
It is also expected that the students improve their creativity and critical thinking skills.
-
Teaching methodologies and assessment
Teaching methodologies and assessment
Theoretical-practical classes with exposition of theory and presentation of practical examples.
Exercises to be carried out during the class, with the support and validation of the Teacher.
Exercises to do at home.
Assessment: 3 mini-tests and a project
-
References
References
Não aplicável.
-
Office Hours
Office Hours
O atendimento será feito por agendamento caso-a-caso. O aluno deverá contactar o professor por e-mail (bcipriano@ulusofona.pt), explicando a razão da mecessidade de contacto. Em função da disponibilidade do Professor e do Aluno, o apoio poderá ser por e-mail, por video-conferência, ou até presencial caso o Professor entenda que se justifica.
-
Mobility
Mobility
No