direkt zum Inhalt springen

direkt zum Hauptnavigationsmenü

Sie sind hier

TU Berlin

Inhalt des Dokuments

Welcome to the Big Data Management Group at the TU Berlin!

Key attributes of Big Data can be described by the three (or several) "V's": Big Volume, Big Velocity, and Big Variety. In this group, we mainly focus on the last "V", the Big variety of data: 

To use and combine data from E-commerce, sensors, and social media services, integration and curation routines have to be employed. The heterogeneity of data impedes the seamless integration of different sources, requiring human intervention in form of exhaustive profiling and data preparation efforts. Hence, research on Big Data calls for scalable data profiling and integration systems that enable curation and consumption of large and many and diverse data sources.

Along with profiling and integration of large datasets, the deployment of sophisticated analytics on data (big analytics) is strongly related to the above mentioned problem. We are interested in systems that leverage mining and machine learning techniques to derive knowledge from dirty and poorly organized data. This includes developing sketching and summarizing techniques that reduce a big dataset to its relevant core information.

News

Paper Accepted for SIGMOD 2019

Sunday, the 05. May 2019

Our paper on the configuration-free error detection system was accepted for SIGMOD 2019. more to: Paper Accepted for SIGMOD 2019

Paper Accepted for SSDBM 2019

Sunday, the 05. May 2019

Our paper on the estimating the error detection performance was accepted for SSDBM 2019. more to: Paper Accepted for SSDBM 2019

First Prize at BTW Data Science Challenge 2019

Thursday, the 07. March 2019

Mahdi Esmailoghli and his team won the BTW Data Science Challenge in 2019. more to: First Prize at BTW Data Science Challenge 2019

Short Paper accepted for EDBT 2019

Sunday, the 30. December 2018

We are happy to announce that our short paper "Feature Engineering for Cross-Language Record Linkage" by Öykü Özlem Çakal, Mohammad Mahdavi and Ziawasch Abedjan was accepted for presentation at EDBT 2019. more to: Short Paper accepted for EDBT 2019

Paper accepted for ICDE 2019

Monday, the 17. December 2018

We are happy to announce that our research paper "Unsupervised String Transformation Learning for Entity Consolidation" was accepted for publication in the ICDE 2018 proceedings. more to: Paper accepted for ICDE 2019

New Book on Data Profiling

Friday, the 23. November 2018

We are happy to announce that our book on "Data Profiling", written by Ziawasch Abedjan, Lukasz Golab, Felix Naumann, and Thorsten Pappenbrock, is published by Morgan and Claypool and available for purchase. more to: New Book on Data Profiling

Larysa receives PAS Scholarship

Friday, the 21. September 2018

Larysa Visengeriyeva received a PAS scholarship that is awarded to female researchers in the final phase of their dissertation. more to: Larysa receives PAS Scholarship

Visit by Michael Stonebraker

Friday, the 21. September 2018

Michael Stonebraker visits the BigDaMa group at the TU Berlin and gives a talk in the BBDC Seminar. more to: Visit by Michael Stonebraker

Chapter published in eBISS 2017

Monday, the 27. August 2018

Prof. Abedjan contributed one chapter to the sevenths volume of the eBISS series titled "Business Intelligence and Big Data". more to: Chapter published in eBISS 2017

Invited Talk at the Big Data Workshop of the German Academic Scholarship Foundation

Thursday, the 28. June 2018

The German Academic Scholarship Foundation organized a 3 day workshop on relevant big data topics. more to: Invited Talk at the Big Data Workshop of the German Academic Scholarship Foundation

Paper accepted for SSDBM 2018

Monday, the 14. May 2018

Our paper on Metadata-Driven Error Detection was accepted for SSDB 2018. more to: Paper accepted for SSDBM 2018

Paper accepted for ICDE 2018

Tuesday, the 13. February 2018

Our paper on data discovery was accepted for ICDE 2018. more to: Paper accepted for ICDE 2018

Demo Paper accepted at ICDE

Sunday, the 24. December 2017

Our proposal to demonstrate the workflow generation of data civilizer was accepted at ICDE 2018. more to: Demo Paper accepted at ICDE

DFG Grant: Tractable Curation Workflows

Friday, the 03. November 2017

We are pleased to announce that the DFG is supporting our research ... more to: DFG Grant: Tractable Curation Workflows

Digital Science Match Paticipation

Monday, the 15. May 2017

Prof. Abedjan joined the science match with a presentation on data integration research. more to: Digital Science Match Paticipation

Invited Talk at HPI Symposium on Future Trends in SOC

Thursday, the 27. April 2017

Prof. Abedjan was invited to present his talk "Data Curation in the Wild: Limits and Challenges" at the annual HPI Symposium on Future Trends in Service-oriented Computing. more to: Invited Talk at HPI Symposium on Future Trends in SOC

Demo Paper accepted at SIGMOD 2017

Monday, the 27. February 2017

The Data Civilizer Demo in collaboration with MIT, QCRI, and University of Waterloo was accepted at SIGMOD 2017. more to: Demo Paper accepted at SIGMOD 2017

Tutorial accepted at SIGMOD 2017

Tuesday, the 21. February 2017

Our Tutorial on Data Profiling has been accepted for a 90 minute presentation at SIGMOD 2017. more to: Tutorial accepted at SIGMOD 2017

Onwrks receives Exist Funding

Tuesday, the 06. December 2016

We congratulate the founders of Onwrks, Anatoli Kantarovich, Nimrod Knoller und Michael Steimel for receiving the Exist starting grant. Onwrks is a Berlin-based software startup, specializing in digital tools for wind turbine data management. It is scientifically mentored by Prof. Ziawasch Abedjan more to: Onwrks receives Exist Funding

Amazon Tech Talk

Tuesday, the 15. November 2016

Prof. Abedjan was invited for a talk at Amazon Labs, Berlin. more to: Amazon Tech Talk

Filter Workshop Berlin

Monday, the 26. September 2016

Prof. Abedjan participated in the interdisciplinary "Filter" workshop. In his presentation with the title "NoFilter: Filtering, Transforming, and Cleaning Data", he described the role of filters in the data integration process. more to: Filter Workshop Berlin

Paper accepted for CIDR 2017.

Wednesday, the 12. October 2016

Our paper "The Data Civilizer System" was accepted for CIDR 2017. more to: Paper accepted for CIDR 2017.

Zusatzinformationen / Extras

Quick Access:

Schnellnavigation zur Seite über Nummerneingabe

Contact

Prof. Dr. Ziawasch Abedjan
Big Data Management
Faculty of EECS (IV)
Building EN 7
Room EN 723
Einsteinufer 17
10587 Berlin
+49 30 314 28007
+49 30 31421601

Mo-Fr