This project is read-only.
SyntheticDataGenerator tax options

Options for generating a taxonomy can be displayed by typing

SyntheticDataGenerator tax

Available Options
Name Value Description Default value
-fname filename Output base file name No default
-tlen double Average transaction length 10
-nitems integer Item count 100000
-randseed integer Master random seed (must <= 0) 0*
-lit.npats integer Large item set pattern count 10000
-lit.patlen double Large item set average pattern length 4
-lit.corr double Large item set correlation (-corr) 0.25
-lit.conf double Large item set confidence (-conf) 0.75
-nroots integer Number of roots 250
-nlevels double Number of levels 0
-fanout double Fan out degree 5
-depth double Taxonomy depth ratio 1

* A randseed of zero results in a random seed being automatically generated

The taxonomy generator produces three files:

filename.config The parameters used to generate the taxonomy
filename.taxonomy The taxonomy
filename.patterns The large item set patterns
filename.transactions The transactions

At a minimum, an output file name must be specified, e.g.

SyntheticDataGenerator tax -fname taxonomy

Last edited Feb 9, 2011 at 10:16 AM by arthur_pitman, version 10


No comments yet.