direkt zum Inhalt springen

direkt zum Hauptnavigationsmenü

Sie sind hier

TU Berlin

Inhalt des Dokuments

Welcome to the Big Data Management Group at the TU Berlin!

Key attributes of Big Data can be described by the three (or several) "V's": Big Volume, Big Velocity, and Big Variety. In this group, we mainly focus on the last "V", the Big variety of data: 

To use and combine data from E-commerce, sensors, and social media services, integration and curation routines have to be employed. The heterogeneity of data impedes the seamless integration of different sources, requiring human intervention in form of exhaustive profiling and data preparation efforts. Hence, research on Big Data calls for scalable data profiling and integration systems that enable curation and consumption of large and many and diverse data sources.

Along with profiling and integration of large datasets, the deployment of sophisticated analytics on data (big analytics [1]) is strongly related to the above mentioned problem. We are interested in systems that leverage mining and machine learning techniques to derive knowledge from dirty and poorly organized data. This includes developing sketching and summarizing techniques that reduce a big dataset to its relevant core information.

News

Larysa defended her PhD [2]

Thursday, the 27. February 2020

BigDaMa graduated their first PhD student. more to: Larysa defended her PhD [3]

Paper accepted for JDIQ [4]

Friday, the 08. November 2019

Our paper on "Anatomy of metadata for Data Curation" was accepted for publication at JDIQ. more to: Paper accepted for JDIQ [5]

Paper Accepted for the Gongshow Presentation at CIDR 2020 [6]

Thursday, the 24. October 2019

Our paper was accepted for the Gongshow Presentation at CIDR 2020. more to: Paper Accepted for the Gongshow Presentation at CIDR 2020 [7]

Paper Accepted for Datenbank-Spektrum 2019 [8]

Friday, the 30. August 2019

Our paper was accepted for Datenbank-Spektrum 2019. more to: Paper Accepted for Datenbank-Spektrum 2019 [9]

Paper Accepted for CIKM 2019 [10]

Saturday, the 10. August 2019

Our paper was accepted for CIKM 2019. more to: Paper Accepted for CIKM 2019 [11]

Ziawasch Abedjan selected as GI Juniorfellow [12]

Wednesday, the 31. July 2019

more to: Ziawasch Abedjan selected as GI Juniorfellow [13]

Paper Accepted for LWDA 2019 [14]

Thursday, the 25. July 2019

Our paper was accepted for LWDA 2019. more to: Paper Accepted for LWDA 2019 [15]

Paper Accepted for SIGMOD 2019 [16]

Sunday, the 05. May 2019

Our paper on the configuration-free error detection system was accepted for SIGMOD 2019. more to: Paper Accepted for SIGMOD 2019 [17]

Paper Accepted for SSDBM 2019 [18]

Sunday, the 05. May 2019

Our paper on the estimating the error detection performance was accepted for SSDBM 2019. more to: Paper Accepted for SSDBM 2019 [19]

First Prize at BTW Data Science Challenge 2019 [20]

Thursday, the 07. March 2019

Mahdi Esmailoghli and his team won the BTW Data Science Challenge in 2019. more to: First Prize at BTW Data Science Challenge 2019 [21]

Short Paper accepted for EDBT 2019 [22]

Sunday, the 30. December 2018

We are happy to announce that our short paper "Feature Engineering for Cross-Language Record Linkage" by Öykü Özlem Çakal, Mohammad Mahdavi and Ziawasch Abedjan was accepted for presentation at EDBT 2019. more to: Short Paper accepted for EDBT 2019 [23]

Paper accepted for ICDE 2019 [24]

Monday, the 17. December 2018

We are happy to announce that our research paper "Unsupervised String Transformation Learning for Entity Consolidation" was accepted for publication in the ICDE 2018 proceedings. more to: Paper accepted for ICDE 2019 [25]

New Book on Data Profiling [26]

Friday, the 23. November 2018

We are happy to announce that our book on "Data Profiling", written by Ziawasch Abedjan, Lukasz Golab, Felix Naumann, and Thorsten Pappenbrock, is published by Morgan and Claypool and available for purchase. more to: New Book on Data Profiling [27]

Larysa receives PAS Scholarship [28]

Friday, the 21. September 2018

Larysa Visengeriyeva received a PAS scholarship that is awarded to female researchers in the final phase of their dissertation. more to: Larysa receives PAS Scholarship [29]

Visit by Michael Stonebraker [30]

Friday, the 21. September 2018

Michael Stonebraker visits the BigDaMa group at the TU Berlin and gives a talk in the BBDC Seminar. more to: Visit by Michael Stonebraker [31]

Chapter published in eBISS 2017 [32]

Monday, the 27. August 2018

Prof. Abedjan contributed one chapter to the sevenths volume of the eBISS series titled "Business Intelligence and Big Data". more to: Chapter published in eBISS 2017 [33]

Paper accepted for SSDBM 2018 [34]

Monday, the 14. May 2018

Our paper on Metadata-Driven Error Detection was accepted for SSDB 2018. more to: Paper accepted for SSDBM 2018 [35]

Paper accepted for ICDE 2018 [36]

Tuesday, the 13. February 2018

Our paper on data discovery was accepted for ICDE 2018. more to: Paper accepted for ICDE 2018 [37]

Demo Paper accepted at ICDE [38]

Sunday, the 24. December 2017

Our proposal to demonstrate the workflow generation of data civilizer was accepted at ICDE 2018. more to: Demo Paper accepted at ICDE [39]

DFG Grant: Tractable Curation Workflows [40]

Friday, the 03. November 2017

We are pleased to announce that the DFG is supporting our research ... more to: DFG Grant: Tractable Curation Workflows [41]

Contact

Prof. Dr. Ziawasch Abedjan
Big Data Management
Faculty of EECS (IV)
Building EN 7
Room EN 723
Einsteinufer 17
10587 Berlin
+49 30 314 28007
+49 30 31421601
e-mail query [42]
Mo-Fr
------ Links: ------

Zusatzinformationen / Extras

Quick Access:

Schnellnavigation zur Seite über Nummerneingabe

Auxiliary Functions

Copyright TU Berlin 2008