PCDC - On the Highway to Data

A Tool for the Fast Generation of Large Synthetic Data Sets

Parallel Coordinates for Data Creation (PCDC) is a new visual-interactive tool for the fast generation of synthetic labeled multidimensional data sets. Multivariate data need to be analyzed in various domains such as finance, biology or medicine using complex data mining techniques. For the evaluation or presentation of the techniques, e.g., for assessing their sensitivity to specific data properties, test data need to be generated.
PCDC allows for a fast and intuitive creation of multivariate data with several classes. It is based on interactive definition of data regions and data distributions in a parallel coordinates view. It offers a quick definition of data regions over several dimensions in one interface. Moreover, the users can directly see the outcome of their settings in the same view without the need for switching between data generation and output visualization. Our tool enables also an easy adjustment of the data generation parameters for creating additional similar datasets.

Video: PCDC, example usecase
Program Files: PCDC v1.1
Manual: Manual.pdf
Contact: Sebastian Bremm, Tatiana von Landesberger

Changelog v1.1 (29.01.2013):
- Variable re-sampling rate to generate datasets with a specific similarity
- Random order of line rendering
- Optional transparency in line rendering
- CSV export option
- Variable value range of axis
- Minor bugfixes and improvements

Related Publication:

Bremm, S. , von Landesberger, T., Heß, M., Fellner, D.W.
PCDC - On the Highway to Data - A Tool for the Fast Generation of Large Synthetic Data Sets
International Workshop on Visual Analytics 2012
bib

Contact

Technische Universität Darmstadt

Interactive Graphics Systems Group

Fraunhoferstr. 5
64283 Darmstadt

Tel. +49 6151 155 679

icon email office@gris.tu-

A A A | Print Print | Legal note Legal Notes | Contact Contact
to topto top