Difference between revisions of "Main Page"

From VistrailsWiki
Jump to navigation Jump to search
(Update to reflect the new VisTrailsJL)
 
(15 intermediate revisions by 6 users not shown)
Line 1: Line 1:
This is the official website for VisTrails, an open-source scientific workflow and provenance management system that supports data exploration and visualization.
= VisTrails =


VisTrails provides a general infrastructure that can be combined with existing systems and libraries. Official releases come with several packages, including VTK, matplotlib, and ImageMagick. The system is written in Python/Qt and runs on Mac, Unix and Windows.
'''VisTrails''' is an open-source scientific workflow and provenance management system developed at the [https://vida.engineering.nyu.edu/ VIDA Center] at New York University. It supports computational science by capturing and managing the complete history of the exploratory process: the workflows, their executions, and the results they produce.


[[#Downloading VisTrails|Read more >]]
VisTrails is actively developed again. The new version, '''[[VisTrailsJL]]''', is a complete reimplementation in [https://julialang.org/ Julia] that brings modern performance, notebook-based workflow authoring, and native compatibility with existing <code>.vt</code> files. See the [https://github.com/VIDA-NYU/VisTrailsJL GitHub repository] to get started.


== What's New ==


== Downloading VisTrails ==
After a hiatus since 2018, VisTrails is back. '''VisTrailsJL''' (v2.2) is a ground-up reimplementation in Julia that preserves everything that made the original system valuable — comprehensive provenance, visual workflow management, and support for real scientific use cases — while modernizing the foundation:
Current and past releases are available on our [[downloads|download page]].


== News ==
* '''Julia reimplementation''' — Julia's JIT compilation brings performance suitable for demanding scientific workflows, and its rich ecosystem (DataFrames.jl, DifferentialEquations.jl, Plots.jl) is a natural fit.
* [2015-07-10] VisTrails 2.2.2 has been released and is available on the [[Downloads]] page.
* '''Notebook-based workflow authoring''' — Workflows can now be defined directly in Jupyter notebooks using simple <code>#|</code> directives, with no GUI required.
* [2015-04-22] VisTrails 2.2 has been released and is available on the [[Downloads]] page.
* '''Full <code>.vt</code> compatibility''' — Existing workflows created with the Python version can be loaded, replayed, and visualized without modification.
* [2015-03-21] VisTrails 2.1.5 has been released and is available on the [[Downloads]] page.
* '''Git-native version control''' — Standard git replaces the custom versioning infrastructure for workflow history.
* VisTrails 2.1.4 has been released and is available on the [[Downloads]] page.
* '''Python interoperability''' — Existing Python modules and libraries remain accessible via PyCall.jl.
* VisTrails 2.0.3 has been released and is available on the [[Downloads]] page.
* To learn more about how to explore climate data using DV3D and ParaView in UV-CDAT, check out http://www.kitware.com/source/home/post/105
* Comparison of two forecasts of Hurricane Sandy using the DV3D slicer in UV-CDAT http://www.youtube.com/watch?v=BFN0RzN1hSE&feature=em-share_video_user
* Presentation on the use of VisTrails at NASA: VisTrails: Workflow Management System on NEX - https://c3.nasa.gov/nex/static/media/other/Vistrails-Votava.pdf
* VisTrails 2.0 has been released and it is available for download on our [[Downloads]] page.  VisTrails is now distributed under the BSD license and its GUI was completely re-designed to make it easier to manage and interact with multiple workflows. It also includes a much improved [http://www.vistrails.org/usersguide Users' Manual]. See the [http://sourceforge.net/projects/vistrails/files/vistrails/v2.0/README/view release notes] for a complete list of changes. Some notable features include:
** the ability to create mashups;
** support for the creation of reproducible papers;
** a provenance viewer;
** support for visual diffs as well as queries over workflows in different vistrails;
** a Control Flow Assistant to simplify the creation of workflows with loops and conditional constructs;
** support for global variables, so that modules deriving from Constant can be used to create variables that are shared among multiple workflows.  


[[Image:Screen Shot 2012-01-12 at 2.50.19 PM.png|200px|right]]
The original Python codebase (v2.2) is preserved in the repository for reference and compatibility testing.
* UV-CDAT 1.0.0 has been released. More detailed information and binaries are available at http://uvcdat.llnl.gov
** UV-CDAT was demoed at [http://ipaw2012.bren.ucsb.edu IPAW 2012]
** Video tutorials are available at http://uv-cdat.llnl.gov/wiki/VideoTutorials
** Binaries (Mac/RedHAT6/Ubuntu) or build-from-sources options can be found here: http://uv-cdat.llnl.gov/install/
** Please report any issues you encounter via bugzilla at: http://uv-cdat.llnl.gov/bugzilla/


; Quick links
: [https://github.com/VIDA-NYU/VisTrailsJL GitHub (VisTrailsJL)] &nbsp;|&nbsp; [[Documentation]] &nbsp;|&nbsp; [[Publications, Tutorials and Presentations]] &nbsp;|&nbsp; [[MailingLists|Mailing Lists]]


* [[Older News]]
== Core Features ==


== VisTrails Overview ==
=== Provenance and Workflow History ===
[[Image:Cosmology_example.png|480px|right]]
VisTrails is an open-source scientific workflow and provenance management system that provides support for simulations, data exploration and visualization. Whereas workflows have been  traditionally used to automate repetitive tasks, for applications that are exploratory in nature, such as simulations, data analysis and visualization, very little is repeated---change is the norm. As an engineer or scientist generates and evaluates  hypotheses about data under study, a series of different, albeit related, workflows are created while a workflow is adjusted in an interactive process. VisTrails was designed to manage these rapidly-evolving workflows.


A key distinguishing feature of VisTrails is a comprehensive provenance infrastructure that maintains detailed history information about the steps followed and data derived in the course of an exploratory task: VisTrails maintains provenance of data products, of the workflows that derive these products and their executions. This information is persisted as XML files or in a relational database, and it allows users to navigate workflow versions in an intuitive way, to undo changes but not lose any results, to visually compare different workflows and their results, and to examine the actions that led to a result. It also enables a series operations and user interfaces that simplify workflow design and use, including the ability to create and refine workflows by analogy and to query workflows by example.
A defining feature of VisTrails is its '''comprehensive provenance infrastructure'''. Unlike systems that track only the current state of a workflow, VisTrails maintains the full history of every step taken during an exploratory analysis — what was tried, what was changed, and what results each version produced. This enables users to:


VisTrails supports the creation and execution of workflows. It allows the combination of loosely-coupled resources, specialized libraries, grid and Web services. The released version comes with support for several packages including, [http://www.vtk.org/ VTK], [http://www.imagemagick.org/script/index.php Image Magick], Web Services, and [http://www.scipy.org/PyLab pylab]. You can also download [[UserContributedPackages|packages contributed by our users]], as well as [http://www.vistrails.org/usersguide/v2.1/html/packages.html create your own packages/modules]. There are also a number of [[Projects using VisTrails|projects extending or building on top of VisTrails]].
* Navigate and compare workflow versions in an intuitive tree interface
Workflows can be run interactively, through the VisTrails GUI, or in batch using a VisTrails server.
* Undo changes without losing intermediate results
* Visually diff two workflows and their outputs side by side
* Reproduce any prior result exactly, long after it was first computed


VisTrails 1.0 was released in October 2007. Since then, the system has been downloaded thousands of times. VisTrails is written in Python and it uses the multi-platform Qt library for its user interface. It runs on Mac, Linux and Windows. You can download VisTrails from http://www.vistrails.org/index.php/Downloads
Provenance information is stored as XML or in a relational database (Python version), or managed via standard git (Julia version).
 
=== Building and Running Workflows ===
 
VisTrails supports workflows expressed as '''dataflows''', with support for functional loops and conditional branching. Workflows can be run interactively through the GUI or in batch mode via a server. The system is designed to connect loosely coupled resources — specialized libraries, web services, and grid computing infrastructure.
 
In VisTrailsJL, workflows can also be defined declaratively in Jupyter notebooks:
 
<pre>
#| workflow: my_analysis
 
#| module-id: input
#| module-type: basic:Integer
#| params:
#|  - value: 42
 
#| module-id: process
#| module-type: mypackage:Transform
#| inputs:
#|  - value: input.value
 
#| execute
</pre>
 
Packages and modules are easy to add. The <code>JuliaSource</code> and <code>PythonSource</code> module types allow custom code to be embedded directly in a workflow without creating a full package.


=== Publishing Reproducible Results ===
=== Publishing Reproducible Results ===
VisTrails 2.0 allows the inclusion of reproducible results in LaTeX/PDF documents. We provide a LaTeX package that allows users to add links to their results in the LaTeX source. For example:


<code>
VisTrails 2.0 introduced support for embedding reproducible results directly in LaTeX/PDF documents via a companion LaTeX package. A figure in a compiled PDF becomes active: clicking it invokes VisTrails and re-executes the workflow that produced it on any machine with the software installed.
 
<pre>
\usepackage{vistrails}
\usepackage{vistrails}
...


\begin{figure}
\begin{figure}
\begin{center}
\begin{center}
\subfigure[a=0.9]{\vistrail[filename=alps.vt, version=2, pdf]{width=8cm}}
\caption{Clicking this figure retrieves and re-runs the workflow that produced it.}
\end{center}
\end{figure}
</pre>


\subfigure[a=0.9]{\vistrail[filename=alps.vt, version=2, pdf]{width=8cm}}
=== Querying and Refining Workflows ===


\subfigure[a=0.9]{\vistrail[filename=alps.vt, version=11, pdf,buildalways]{width=8cm}}
Users can construct expressive queries over a collection of workflows using the same interface used to build them. An '''analogy mechanism''' allows complex modifications to be applied to one workflow by example from another, without manually editing workflow specifications — useful when a family of related analyses needs to evolve together.


\caption{A figure produced by an ALPS VisTrails workflow. Clicking the figure retrieves the workflow used to
=== Visualizing and Comparing Results ===
create it. Opening that workflow on a machine with VisTrails and ALPS installed lets the reader execute the full calculation.}


\end{center}
VisTrails provides a '''spreadsheet view''' for comparing the results of multiple workflows or multiple parameterizations of the same workflow side by side. The visual diff interface highlights structural differences between two workflow versions. Workflows and their version trees can be rendered as SVG (VisTrailsJL) or displayed on large-format display walls.


\end{figure}
== Getting Started ==
</code>


Once the LaTeX document is compiled, the figure in the PDF becomes active, and when clicked, it will invoke VisTrails and reproduce the result.
=== VisTrailsJL (Julia — current) ===
You can also upload your results to [http://www.crowdlabs.org CrowdLabs] and export them to Web sites or Wikis, where users can interact with them through a Web browser. See e.g., http://www.crowdlabs.org/vistrails/medleys/details/26/


=== Building Workflows ===
<pre>
VisTrails supports workflows expressed as dataflows. It can also support functional loops and conditional statements (new in VisTrails 1.3!).  In addition, VisTrails is easily extensible; users can quickly create new modules or wrap existing code or libraries for use as modules in VisTrails.  Also, the <code>PythonSource</code> module can be used to write one-time translations or test code in a workflow.  See  the [http://sourceforge.net/projects/vistrails/files/vistrails/vistrails-usersguide-1.3-rev198.pdf/download Users' Guide] for more details on all of these features.
# Clone the repository
git clone https://github.com/VIDA-NYU/VisTrailsJL.git
cd VisTrailsJL/julia


=== Combining VisTrails Provenance with other Tools ===
# Install dependencies
[[Image:DisplayWall_ipod_screenshot.png|180px|right]]
julia --project=. -e 'using Pkg; Pkg.instantiate()'
The VisTrails provenance infrastructure can also be combined with interactive tools. We are developing a series of provenance plugins for different tools, including [https://wci.llnl.gov/codes/visit VisIt] and [http://www.paraview.org/ ParaView]. The [http://www.vistrails.org/index.php/ParaView_Plugin open-source ParaView plugin has been released].


=== Visualizing and Comparing Workflow Results ===
# Load and render an existing workflow
VisTrails provides a ''spreadsheet'' where users can compare the results of multiple workflows, or multiple workflow runs. Besides using a desktop, you can display the results of VisTrails workflows on a display wall or on an iPod. Users can also compare different workflows using the visual difference interface.
julia --project=. -e '
using VisTrailsJL
vt = load_vistrail("../examples/gcd.vt")
workflow = get_pipeline(vt)
render_pipeline_svg(workflow, "workflow.svg")
'
</pre>


=== Querying and Refining Workflows by Example ===
See the [https://github.com/VIDA-NYU/VisTrailsJL/blob/v2.2/julia/QUICKSTART.md Quickstart Guide] for a full walkthrough.
Users can quickly construct expressive queries over a collection of workflows by example, using the same familiar interface they use to build workflows. VisTrails also provides an analogy mechanism whereby users can perform complex modifications to workflows by analogy, without having to directly modify the workflow specifications.


==[[Documentation|System Documentation]]==
=== Python VisTrails (legacy reference) ===
If you want to get more information about VisTrails you can consult the Users' Manual or the FAQ (see links below). If you have questions that are not answered in these resources, you can post it to the users' mailing list: http://www.vistrails.org/index.php/MailingLists


To report bugs, please use our bug tracker: https://github.com/VisTrails/VisTrails/issues
The original Python version (v2.2, requires Python 2 / PyQt4) is preserved in the repository for reference and for loading existing <code>.vt</code> files in legacy environments.


=== [[Documentation#Screen Shots|Screen Shots]]===
<pre>
=== [[Users_Guide|User's Guide]]===
# GUI mode
A Web version of the Users Guide is currently under development and can be accessed at http://www.vistrails.org/usersguide.
python vistrails/run.py
=== [[FAQ]]===


==[[MailingLists|Mailing Lists]]==
# Batch mode
==[[Publications, Tutorials and Presentations]]==
python vistrails/run.py --batch [options]
==[[People]]==
</pre>


== [http://www.vistrails.org/index.php/Projects_using_VisTrails Some Projects using VisTrails] ==
== Projects Using VisTrails ==


VisTrails has supported real scientific workflows across a wide range of domains. The following projects reflect the breadth of communities that have relied on the system.


{| class="wikitable"
{| class="wikitable"
|-
|-
! NASA Climate Data Analysis  
! USGS Habitat Modeling
! NASA Climate Data Analysis
! DOE CDAT
! DOE CDAT
! ALPS Simulations
|-
|-
| [[Image:usgs.png|200px|left]]
| [[Image:nasa.png|200px|left]]
| [[Image:nasa.png|200px|left]]
| [[Image:cdat.png|200px|left]]
| [[Image:cdat.png|200px|left]]
| [[Image:alps-shot.png|200px|left]]
|
|}
|}


{| class="wikitable"
{| class="wikitable"
|-
|-
! ALPS Simulations
! NSF STC CMOP
! NSF STC CMOP
! NSF CDI Wildfire
! NSF CDI Wildfire
! NSF DataONE-EVA
|-
|-
| [[Image:alps-shot.png|200px|left]]
| [[Image:cmop-ss.png|200px|left]]
| [[Image:cmop-ss.png|200px|left]]
| [[Image:wildfire.png|200px|center]]
| [[Image:wildfire.png|200px|center]]
|}
{| class="wikitable"
|-
! NSF DataONE-EVA
|-
| [[Image:eva.png|200px|left]]
| [[Image:eva.png|200px|left]]
|
|}
|}
[http://www.vistrails.org/index.php/Projects_using_VisTrails See other projects]


== [[Vistrails and Teaching]] ==
[https://vistrails.org/index.php/Projects_using_VisTrails See all projects using VisTrails]
 
== VisTrails in Teaching ==
 
VisTrails has been used as a teaching tool in courses on Scientific Visualization and Digital Media. Its provenance infrastructure makes it particularly effective in educational settings, where capturing and comparing student workflows provides rich feedback for instructors and learners alike.
 
Our [http://www.cs.utah.edu/~juliana/pub/vistrails-teaching-eurographics2010.pdf paper] describing a provenance-rich teaching methodology received the '''Best Paper Award''' at Eurographics 2010 Education.
 
[[Vistrails and Teaching|More on VisTrails and Teaching]]
 
== System Documentation ==
 
* [[Documentation|Documentation overview]]
* [https://github.com/VIDA-NYU/VisTrailsJL/blob/v2.2/julia/README.md VisTrailsJL README]
* [https://github.com/VIDA-NYU/VisTrailsJL/blob/v2.2/julia/QUICKSTART.md Quickstart Guide]
* [https://github.com/VIDA-NYU/VisTrailsJL/blob/v2.2/julia/docs/IMPLEMENTATION_STATUS.md Implementation Status]
* [[FAQ]]
* [[Users_Guide|Python User's Guide (legacy)]]
 
To report bugs or request features, please use the [https://github.com/VIDA-NYU/VisTrailsJL/issues issue tracker].
 
For questions not covered by the documentation, post to the [https://vistrails.org/index.php/MailingLists mailing list].
 
== Citing VisTrails ==
 
If you use VisTrails or VisTrailsJL in your research, please cite the relevant work:


The VisTrails system has been used a tool to teach different courses, including Scientific Visualization and Digital Media.
'''Original VisTrails system:'''
For more information on the educational uses of VisTrails see [http://www.vistrails.org/index.php/Vistrails_and_Teaching VisTrails and Teaching]. Our [http://www.cs.utah.edu/~juliana/pub/vistrails-teaching-eurographics2010.pdf paper] describing a provenance-rich teaching methodology received the Best Paper Award at the Eurographics 2010 Education program.
<pre>
@inproceedings{vistrails2006,
  title    = {VisTrails: visualization meets data management},
  author    = {Callahan, Steven P and Freire, Juliana and Scheidegger,
              Carlos E and Silva, Cl{\'a}udio T and Vo, Huy T},
  booktitle = {Proceedings of the 2006 ACM SIGMOD International Conference
              on Management of Data},
  pages    = {745--747},
  year      = {2006},
  doi      = {10.1145/1142473.1142574}
}
</pre>
 
'''VisTrailsJL (Julia reimplementation):'''
<pre>
@software{vistrailsjl2025,
  title  = {VisTrailsJL: A Julia Implementation of VisTrails},
  author = {Silva, Claudio T},
  year  = {2025},
  url    = {https://github.com/VIDA-NYU/VisTrailsJL}
}
</pre>
 
[[Publications, Tutorials and Presentations|Full publication list]]
 
== People ==
[[People]]


== Sponsors ==
== Sponsors ==


This work has been partially supported by the National Science Foundation under grants  
This work has been supported in part by the National Science Foundation under grants
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0905385 IIS-0905385],
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0905385 IIS-0905385],
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0844572 IIS-0844572],
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0844572 IIS-0844572],
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0746500 IIS CAREER-0746500],
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0746500 IIS CAREER-0746500],
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0751152 CNS-0751152],
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0751152 CNS-0751152],
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0513692 IIS-0513692],  
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0513692 IIS-0513692],
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0401498 CCF-0401498],  
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0401498 CCF-0401498],
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0541560 CNS-0541560], [http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0405402 OISE-0405402],
and others; by the Department of Energy under the SciDAC program (SDM, VACET, and UV-CDAT);
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0424602 OCE-0424602],
and by IBM Faculty Awards (2005–2008) and a University of Utah Seed Grant.
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0524096 CNS-0524096],
[http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0534628 IIS-0534628],
the Department of Energy under the SciDAC program
([http://www.scidac.org/compsci/SDM.html SDM]
and
[http://www.scidac.org/viz/VACET.html VACET]) and UV-CDAT,
IBM Faculty Awards (2005, 2006, 2007, and 2008) and a University of Utah Seed Grant.
 
== Related Sites ==
 
[[BirdVis]]
 
[http://www.crowdlabs.org CrowdLabs]
 
[[RepeatabilityCentral]]


[[ProvenanceAnalytics]]
== Related ==


[[BirdVis]] &nbsp;|&nbsp;
[http://www.crowdlabs.org CrowdLabs] &nbsp;|&nbsp;
[[RepeatabilityCentral]] &nbsp;|&nbsp;
[[ProvenanceAnalytics]] &nbsp;|&nbsp;
[[Provenance: potpourri]]
[[Provenance: potpourri]]

Latest revision as of 22:15, 23 April 2026

VisTrails

VisTrails is an open-source scientific workflow and provenance management system developed at the VIDA Center at New York University. It supports computational science by capturing and managing the complete history of the exploratory process: the workflows, their executions, and the results they produce.

VisTrails is actively developed again. The new version, VisTrailsJL, is a complete reimplementation in Julia that brings modern performance, notebook-based workflow authoring, and native compatibility with existing .vt files. See the GitHub repository to get started.

What's New

After a hiatus since 2018, VisTrails is back. VisTrailsJL (v2.2) is a ground-up reimplementation in Julia that preserves everything that made the original system valuable — comprehensive provenance, visual workflow management, and support for real scientific use cases — while modernizing the foundation:

  • Julia reimplementation — Julia's JIT compilation brings performance suitable for demanding scientific workflows, and its rich ecosystem (DataFrames.jl, DifferentialEquations.jl, Plots.jl) is a natural fit.
  • Notebook-based workflow authoring — Workflows can now be defined directly in Jupyter notebooks using simple #| directives, with no GUI required.
  • Full .vt compatibility — Existing workflows created with the Python version can be loaded, replayed, and visualized without modification.
  • Git-native version control — Standard git replaces the custom versioning infrastructure for workflow history.
  • Python interoperability — Existing Python modules and libraries remain accessible via PyCall.jl.

The original Python codebase (v2.2) is preserved in the repository for reference and compatibility testing.

Quick links
GitHub (VisTrailsJL)  |  Documentation  |  Publications, Tutorials and Presentations  |  Mailing Lists

Core Features

Provenance and Workflow History

A defining feature of VisTrails is its comprehensive provenance infrastructure. Unlike systems that track only the current state of a workflow, VisTrails maintains the full history of every step taken during an exploratory analysis — what was tried, what was changed, and what results each version produced. This enables users to:

  • Navigate and compare workflow versions in an intuitive tree interface
  • Undo changes without losing intermediate results
  • Visually diff two workflows and their outputs side by side
  • Reproduce any prior result exactly, long after it was first computed

Provenance information is stored as XML or in a relational database (Python version), or managed via standard git (Julia version).

Building and Running Workflows

VisTrails supports workflows expressed as dataflows, with support for functional loops and conditional branching. Workflows can be run interactively through the GUI or in batch mode via a server. The system is designed to connect loosely coupled resources — specialized libraries, web services, and grid computing infrastructure.

In VisTrailsJL, workflows can also be defined declaratively in Jupyter notebooks:

#| workflow: my_analysis

#| module-id: input
#| module-type: basic:Integer
#| params:
#|   - value: 42

#| module-id: process
#| module-type: mypackage:Transform
#| inputs:
#|   - value: input.value

#| execute

Packages and modules are easy to add. The JuliaSource and PythonSource module types allow custom code to be embedded directly in a workflow without creating a full package.

Publishing Reproducible Results

VisTrails 2.0 introduced support for embedding reproducible results directly in LaTeX/PDF documents via a companion LaTeX package. A figure in a compiled PDF becomes active: clicking it invokes VisTrails and re-executes the workflow that produced it on any machine with the software installed.

\usepackage{vistrails}

\begin{figure}
\begin{center}
\subfigure[a=0.9]{\vistrail[filename=alps.vt, version=2, pdf]{width=8cm}}
\caption{Clicking this figure retrieves and re-runs the workflow that produced it.}
\end{center}
\end{figure}

Querying and Refining Workflows

Users can construct expressive queries over a collection of workflows using the same interface used to build them. An analogy mechanism allows complex modifications to be applied to one workflow by example from another, without manually editing workflow specifications — useful when a family of related analyses needs to evolve together.

Visualizing and Comparing Results

VisTrails provides a spreadsheet view for comparing the results of multiple workflows or multiple parameterizations of the same workflow side by side. The visual diff interface highlights structural differences between two workflow versions. Workflows and their version trees can be rendered as SVG (VisTrailsJL) or displayed on large-format display walls.

Getting Started

VisTrailsJL (Julia — current)

# Clone the repository
git clone https://github.com/VIDA-NYU/VisTrailsJL.git
cd VisTrailsJL/julia

# Install dependencies
julia --project=. -e 'using Pkg; Pkg.instantiate()'

# Load and render an existing workflow
julia --project=. -e '
using VisTrailsJL
vt = load_vistrail("../examples/gcd.vt")
workflow = get_pipeline(vt)
render_pipeline_svg(workflow, "workflow.svg")
'

See the Quickstart Guide for a full walkthrough.

Python VisTrails (legacy reference)

The original Python version (v2.2, requires Python 2 / PyQt4) is preserved in the repository for reference and for loading existing .vt files in legacy environments.

# GUI mode
python vistrails/run.py

# Batch mode
python vistrails/run.py --batch [options]

Projects Using VisTrails

VisTrails has supported real scientific workflows across a wide range of domains. The following projects reflect the breadth of communities that have relied on the system.

USGS Habitat Modeling NASA Climate Data Analysis DOE CDAT
Usgs.png
Nasa.png
Cdat.png
ALPS Simulations NSF STC CMOP NSF CDI Wildfire
Alps-shot.png
Cmop-ss.png
Wildfire.png
NSF DataONE-EVA
Eva.png

See all projects using VisTrails

VisTrails in Teaching

VisTrails has been used as a teaching tool in courses on Scientific Visualization and Digital Media. Its provenance infrastructure makes it particularly effective in educational settings, where capturing and comparing student workflows provides rich feedback for instructors and learners alike.

Our paper describing a provenance-rich teaching methodology received the Best Paper Award at Eurographics 2010 Education.

More on VisTrails and Teaching

System Documentation

To report bugs or request features, please use the issue tracker.

For questions not covered by the documentation, post to the mailing list.

Citing VisTrails

If you use VisTrails or VisTrailsJL in your research, please cite the relevant work:

Original VisTrails system:

@inproceedings{vistrails2006,
  title     = {VisTrails: visualization meets data management},
  author    = {Callahan, Steven P and Freire, Juliana and Scheidegger,
               Carlos E and Silva, Cl{\'a}udio T and Vo, Huy T},
  booktitle = {Proceedings of the 2006 ACM SIGMOD International Conference
               on Management of Data},
  pages     = {745--747},
  year      = {2006},
  doi       = {10.1145/1142473.1142574}
}

VisTrailsJL (Julia reimplementation):

@software{vistrailsjl2025,
  title  = {VisTrailsJL: A Julia Implementation of VisTrails},
  author = {Silva, Claudio T},
  year   = {2025},
  url    = {https://github.com/VIDA-NYU/VisTrailsJL}
}

Full publication list

People

People

Sponsors

This work has been supported in part by the National Science Foundation under grants IIS-0905385, IIS-0844572, IIS CAREER-0746500, CNS-0751152, IIS-0513692, CCF-0401498, and others; by the Department of Energy under the SciDAC program (SDM, VACET, and UV-CDAT); and by IBM Faculty Awards (2005–2008) and a University of Utah Seed Grant.

Related

BirdVis  |  CrowdLabs  |  RepeatabilityCentral  |  ProvenanceAnalytics  |  Provenance: potpourri