filmeu

Class Engenharia de Dados para Data Science

  • Presentation

    Presentation

    This course focuses on "data engineering" and its intersection with "data science". In this context, it is intended that students gain technical skills in several independent but related topics. The most relevant areas of this course are databases and programming, which are the fundamental skills one needs to be able to play the role of "data engineer" in academic and/or industrial projects. The inclusion of the course in the masters program is justified with the importance of data collection, validation and processing skills in order to be able to have data that can be "explored" with the knowledge acquired in the other curricular units.

  • Code

    Code

    ULHT457-2-25062
  • Syllabus

    Syllabus

    1. Introduction to Data Engineering Git & GitHub
    2. Introduction to version control systems
      1. Learning elementary work processes using the Git software and the GitHub online platform
    3. Databases & SQL Relational Databases
      1. SQL language
      2. SQL Injection (elementary notions)
    4. Python Programming
      1. From the data extraction and data processing points of view
      2. From the exploratory data analysis point of view
    5. Algoritmic complexity and efficiency It's importance when dealing with large amounts of data Jupiter notebook
    6. Linux
      1. Introduction to the GNU/Linux operating system
      2. File system navigation (commands)
  • Objectives

    Objectives

    Students are expected to learning technical skills related with:

    - Version control (Git & GitHub)

    - Relational Data Bases (e.g. MySQL) and SQL

    - Programming with the Python language, focused on the interaction with relational databases

    - Elementary notions of algorithmic complexity and efficiency

    - Elementary notions of the Linux operating system from an end user's perspective

    It is also expected that the students improve their creativity and critical thinking skills.

  • Teaching methodologies and assessment

    Teaching methodologies and assessment

    Theoretical-practical classes with exposition of theory and presentation of practical examples.

    Exercises to be carried out during the class, with the support and validation of the Teacher.

    Exercises to do at home.

    Assessment: 3 mini-tests and a project

  • References

    References

    • Damas, Luís - SQL - Structured Query Language. 14ª edição. Portugal. FCA, 2017. ISBN: 9789727228294

     

SINGLE REGISTRATION
Lisboa 2020 Portugal 2020 Small financiado eu 2024 prr 2024 republica portuguesa 2024 Logo UE Financed Provedor do Estudante Livro de reclamaões Elogios