Data pipelines powered by Open source and fun

The world of data is ever-growing. There are ample technologies out in the market. Through the session, we will see some beautiful open source tools in action. Well knowing tools is often the first step, but indeed an important one. This session is best suited for beginners, keen to dive into data world.

Typical data engineering actions –
-1) Connecting with data sources ⚑️
0) Pre-processing 🀫
1) Ingestion πŸ› 
2) Business logic πŸ™‹πŸ»β€β™€οΈ
3) Dashboarding πŸ‘“
4) Lift and shifts πŸ’ͺ🏽
5) Quality control and statistical checks β˜‘οΈ

* Everything else follows…
Often each step adds a special tooling need. Open source is a great way to get started eliminating steep learning curves.

Speaker

  • Sayantika Banik
    Sayantika Banik
    Quansight

    Sayantika is a D&I advocate, Open-source contributor, and Data engineer at Quansight. Messing with data is her core strength and giving back to the community is very important to her. More about SayantikaΒ΄s work, contribution – sayantikabanik.com, GitHub

Date

Jul 20 2022

Time

11:15 - 11:45

Location

Room Barcelona I