Note
Siren allows you to interactively mine and visualize redescriptions from your data.
We outline here some high-level interactions offered by Siren.
Because examples are worth many words, here are some datasets to try your hands on.
After you get Siren installed and running, it will open the tools window, with tabs for variables and redescriptions, all of them empty. Hence, the first thing to do is to import data or open an existing siren package to start working.
If you already have some siren package (i.e. with a .siren
extension) you can open it via the interface menu .
Otherwise, or if you want to work on a new data set, you can import data to Siren, this will populate the Entities
and Variables
tabs.
Once the two sets of variables are loaded and can be seen in the tabs you may want to let the tool mine redescriptions in an fully automated way, using currently enabled variables. The menu entry
redescriptions let you do just that.Alternatively, if both queries in a visualization are empty Siren simply mines redescriptions on your data when clicking on the Expand
button.
Results of the mining process will appear in a new list in the Redescriptions
tab.
Before running a mining task make sure you have adjusted the main mining parameters…
Individual redescriptions and lists can be plotted in different visualizations, depending on the type of your data and the redescriptions.
Double-click on an item (variable, redescription, list) to open the default visualization for that item. The
entry in the main menu and under the contextual menu list available visualizations.You can also press the expansion button to automatically find expansions of any redescription your are currently visualizing and editing. Siren will try to append literals to the current redescription, using the enabled variables.
Again, results will be appended to the redescriptions list in the Expansions
tab.
Before running an expansion task make sure you have adjusted the mining parameters…
A list of redescription can be filtered automatically. That is, the algorithm will go through the redescriptions, from top to bottom in the current order, an check for each redescription, whether it is redundant given the previous ones. If a redescription is found redundant it will be disabled. This is done via the different filter entries in the
menu .The lists of redescriptions from the Redescriptions tab can be exported under different formats from the contextual menu.
You can save your current project, i.e. the data, current redescription lists and parameter preferences as a siren package (i.e. with a .siren
extension) via the interface menu …
If you continue working on the current project you can save changes to the current siren package via the interface menu
.Existing siren packages can be opened via the interface menu
.Lists of redescriptions can be added and deleted from a package. A package needs to be first created before redescriptions lists can be added to it.
The data analysis process typically require a few iterations of mining, looking at the results, tuning the parameters, etc.