Inhalt des Dokuments
Welcome to the Big Data Management Group at the TU Berlin!
Key attributes of Big Data can be described by the three (or several) "V's": Big Volume, Big Velocity, and Big Variety. In this group, we mainly focus on the last "V", the Big variety of data:
To use and combine data from E-commerce, sensors, and social media services, integration and curation routines have to be employed. The heterogeneity of data impedes the seamless integration of different sources, requiring human intervention in form of exhaustive profiling and data preparation efforts. Hence, research on Big Data calls for scalable data profiling and integration systems that enable curation and consumption of large and many and diverse data sources.
Along with profiling and integration of large datasets, the deployment of sophisticated analytics on data (big analytics) is strongly related to the above mentioned problem. We are interested in systems that leverage mining and machine learning techniques to derive knowledge from dirty and poorly organized data. This includes developing sketching and summarizing techniques that reduce a big dataset to its relevant core information.
News
Paper accepted for SIGMOD 2020
Thursday, the 30. April 2020
Our paper in collaboration with the DIMA group was accepted for SIGMOD 2020. more to: Paper accepted for SIGMOD 2020
Raha Won the ACM SIGMOD Most Reproducible Paper Award
Wednesday, the 06. May 2020
more to: Raha Won the ACM SIGMOD Most Reproducible Paper Award
Paper Accepted for Informatik Spektrum Journal
Friday, the 28. February 2020
Our paper on "Data Science für alle: Grundlagen der Datenprogrammierung" is published in Informatik Spektrum journal. more to: Paper Accepted for Informatik Spektrum Journal
Larysa defended her PhD
Thursday, the 27. February 2020
BigDaMa graduated their first PhD student. more to: Larysa defended her PhD
Paper accepted for JDIQ
Friday, the 08. November 2019
Our paper on "Anatomy of metadata for Data Curation" was accepted for publication at JDIQ. more to: Paper accepted for JDIQ
Paper Accepted for the Gongshow Presentation at CIDR 2020
Thursday, the 24. October 2019
Our paper was accepted for the Gongshow Presentation at CIDR 2020. more to: Paper Accepted for the Gongshow Presentation at CIDR 2020
Paper Accepted for Datenbank-Spektrum 2019
Friday, the 30. August 2019
Our paper was accepted for Datenbank-Spektrum 2019. more to: Paper Accepted for Datenbank-Spektrum 2019
Paper Accepted for CIKM 2019
Saturday, the 10. August 2019
Our paper was accepted for CIKM 2019. more to: Paper Accepted for CIKM 2019
Ziawasch Abedjan selected as GI Juniorfellow
Wednesday, the 31. July 2019
Paper Accepted for LWDA 2019
Thursday, the 25. July 2019
Our paper was accepted for LWDA 2019. more to: Paper Accepted for LWDA 2019
Paper Accepted for SIGMOD 2019
Sunday, the 05. May 2019
Our paper on the configuration-free error detection system was accepted for SIGMOD 2019. more to: Paper Accepted for SIGMOD 2019
Paper Accepted for SSDBM 2019
Sunday, the 05. May 2019
Our paper on the estimating the error detection performance was accepted for SSDBM 2019. more to: Paper Accepted for SSDBM 2019
First Prize at BTW Data Science Challenge 2019
Thursday, the 07. March 2019
Mahdi Esmailoghli and his team won the BTW Data Science Challenge in 2019. more to: First Prize at BTW Data Science Challenge 2019
Short Paper accepted for EDBT 2019
Sunday, the 30. December 2018
We are happy to announce that our short paper "Feature Engineering for Cross-Language Record Linkage" by Öykü Özlem Çakal, Mohammad Mahdavi and Ziawasch Abedjan was accepted for presentation at EDBT 2019. more to: Short Paper accepted for EDBT 2019
Paper accepted for ICDE 2019
Monday, the 17. December 2018
We are happy to announce that our research paper "Unsupervised String Transformation Learning for Entity Consolidation" was accepted for publication in the ICDE 2018 proceedings. more to: Paper accepted for ICDE 2019
New Book on Data Profiling
Friday, the 23. November 2018
We are happy to announce that our book on "Data Profiling", written by Ziawasch Abedjan, Lukasz Golab, Felix Naumann, and Thorsten Pappenbrock, is published by Morgan and Claypool and available for purchase. more to: New Book on Data Profiling
Larysa receives PAS Scholarship
Friday, the 21. September 2018
Larysa Visengeriyeva received a PAS scholarship that is awarded to female researchers in the final phase of their dissertation. more to: Larysa receives PAS Scholarship
Visit by Michael Stonebraker
Friday, the 21. September 2018
Michael Stonebraker visits the BigDaMa group at the TU Berlin and gives a talk in the BBDC Seminar. more to: Visit by Michael Stonebraker
Chapter published in eBISS 2017
Monday, the 27. August 2018
Prof. Abedjan contributed one chapter to the sevenths volume of the eBISS series titled "Business Intelligence and Big Data". more to: Chapter published in eBISS 2017
Zusatzinformationen / Extras
Quick Access:
Auxiliary Functions
Contact
Prof. Dr. Ziawasch AbedjanBig Data Management
Faculty of EECS (IV)
Building EN 7
Room EN 723
Einsteinufer 17
10587 Berlin
+49 30 314 28007
+49 30 31421601
e-mail query
Mo-Fr