You do not have permission to edit this page, for the following reason:
The action you have requested is limited to users in the group: Users.
Project description (free text)
Give a concise project description. Include:
With the rapid development and growth of interconnected devices, more physical systems are integrated with computer-based systems, e.g. sensors and actuators can be sensed and controlled remotely across the network. It is enticing to analyze on-board sensor data streaming from devices (e.g. vehicle speed, engine torque etc.), in order to discover interesting patterns and knowledge. We have collected such data from Volvo buses in normal operation. It is interesting to analyze this data from the usage point of view, in order to discover and categorize various vehicle operations in an unsupervised way (using clustering). Clustering is the task of grouping data in such a way that objects in the same group (i.e. cluster) are more similar to each other than to those in other groups (i.e. other clusters). Typical clustering algorithms output a single clustering (i.e. grouping) of the data. However, in real world applications (such as vehicle operation analysis), data can be interpreted in many different ways, leading to different groupings that are reasonable and interesting from different perspectives. The goal of the thesis is to propose a method that allows to discover multiple clustering solutions, compare them, and find out if there is a single best (consensus) clustering, or multiple consistent clustering solutions. In the latter case, each data object would be grouped in multiple clusters, representing different perspectives on the data. (i.e. orthogonal, or independent clusterings). Clustering solutions that differ in a significant but consistent way can be obtained by constructing different views of the data, for example: - Using different combinations of feature may reveal different structures of the data. - Using different similarity/distance measures. - Various data sources (different sources of the same data). - Varying the hyperparameters of the clustering algorithm. - Combining various clustering algorithms, etc. While the main application focuses on grouping vehicle operations, the proposed method could be general and applicable for any data with such orthogonal clusters. References: check the of references above. Contact: - Mohamed-Rafik Bouguelia ( mohbou@hh.se ) - Slawomir Nowaczyk ( slawomir.nowaczyk@hh.se )
Summary:
This is a minor edit Watch this page
Cancel
Home
Research
Education
Partners
People
Contact