DAVI Interaction - redo

Posted Dec 14, 2024 Updated Dec 19, 2024

By Wei Xiong

6 min read

Interaction in Visualization

Why Interaction?

Interaction transforms static diagrams into tools for data exploration.
Allows users to:
- Adjust the visualization: Zoom, pan, filter, and query the data.
- Issue visual queries: Select interesting clusters, see linked data across multiple views.
- Enhance the visualization: Tooltips, brushing and linking, and interactivity for better data understanding.
- Support diverse users and tasks: Interaction helps cater to different goals and user skill levels.

Visual Information Seeking Mantra (Shneiderman)

Mantra: “Overview first, zoom and filter, then details on demand.”
- Start with a broad overview of the entire dataset.
- Zoom in or filter to focus on interesting subsets.
- Retrieve detailed information (e.g., tooltips, labels) as needed.

Overview & Detail vs. Focus & Context

Overview and Detail (Separated):
- Two basic ways:
  1. Spatial Separation: Example: A main view plus a smaller “minimap.”
    - The overview might be a small thumbnail, the detail a large main panel.
  2. Temporal Separation (Zooming): The user zooms into a region. You only see one level at a time.
    - Panning moves your view at the same scale.
    - Zooming changes scale, showing more or less detail.
Focus and Context (Combined):
- Integrate both detailed focus and the broader context in a single continuous view.
- Often achieved through “fisheye” or “lenses” that enlarge a focus region while compressing context around it.
- Examples:
  - Fisheye Tree Views: Large font for the selected node, diminishing sizes for nodes farther away.
  - Table Lens: Compresses rows/columns not in focus into 1-pixel lines showing only color-encoded values.
  - Interactive Lenses: Over a map or graph, a lens that shows detail or computes something (like density or sampling) in the lens region.

Zooming & Panning:
- Represented by “space-scale diagrams,” which show how a “viewing window” moves and changes scale.
- Smooth Navigation:
  - Combines zooming and panning so the user never loses context (origin or destination remain visible).
  - Example: Navigating large graphs by zooming out, showing origin and destination, then zooming back in at the target.

Navigational Cues

For Panning: Radar views that show off-screen elements at the edges, guiding where to pan.
For Zooming:
- Computed “heterogeneity” or “differences” at coarser scale to highlight where something interesting might appear if you zoom in.
- Example: Using color bands (heterogeneity bands) to show potential hidden patterns in a line chart.

Multiple Coordinated Views (MCV)

When to use MCV:
- Rule of Diversity: Many attributes, tasks, or user groups.
- Rule of Complementarity: Different views show different aspects, providing a richer understanding.
- Rule of Decomposition: Break complex data into simpler sub-views.
- Rule of Parsimony: Don’t add views if one suffices.
- Rule of Space-Time Resource Optimization: Save effort and adapt to multiple tasks/users in one environment.
- Rule of Self-Evidence: Make relationships between views clear (e.g., highlight linked items in all views).
- Rule of Consistency: Keep interaction and encoding consistent across views.
- Rule of Attention Management: Use highlighting, filtering, etc. to guide user’s focus.
Brushing and Linking:
- Selecting (brushing) data items in one view highlights the same items in other views.
- Supports comparisons across multiple dimensions or attributes.

Tangible Interaction and Software Architecture

(This part was mentioned to be handled next Monday, so only a brief note)
Tangible Interaction:
- Physical interaction devices or tokens that directly manipulate the visualization.
Software Architecture:
- Interaction often implemented by layering additional pipelines (e.g., lens pipelines parallel to main visualization pipelines).

Visualization Critiques and Examples Given

General Steps in Critiques:

Identify chart type and data being represented.
Consider if the visual encoding matches the data’s nature (expressiveness).
Check perceptual aspects: Is it easy to read? Are encodings effective? Is it colorblind safe?
Consider efficiency: Are there simpler ways to convey the same information? Is it overly complicated?

Critique Example 1: A Poor Chart of Republican Primaries (2016)

Shown as a weird column chart with days of the month encoded as bars, and states vs. primaries over time.
Problems:
- Day of the month is treated like a quantitative variable (bars) while it’s actually an ID-like attribute. Inappropriate chart choice.
- Months encoded with different hues, even though months are ordinal (should use ordered channel).
- Hard to interpret tasks: The chart lacks a sensible purpose or meaningful comparison.
- Better Approaches:
  - Use a calendar-based layout, a timeline, or well-structured bar/line charts where time is on a continuous axis.
  - Example from Washington Post: Stacked bars or timelines more clearly showing when certain states hold their primaries.

Critique Example 2: History of Pandemics Visualization

Complex bubble timeline with 3D perspective.
Each pandemic is a 3D fuzzy ball placed along a timeline, sized by death toll.
Problems:
- Perspective distortion introduces a huge lie factor. Bubbles farther away look smaller than they should.
- Comparing past pandemics to modern ones is distorted by both scale and population changes over time.
- Color usage is unclear. Uncertain what color encodes.
- Some dates not aligned properly (some events placed incorrectly in time).
Better Approaches:
- Normalizing death toll by world population.
- Using simple 2D charts: line graphs, bars, or properly scaled bubbles on a single plane.
- Annotated timelines or interactive visualizations allowing scaling and filtering.

Closing Notes

This lecture wraps up basic static visualization concepts and transitions into interactive visualization techniques.
Next sessions will look at tangible interaction and software architecture, and then delve into design exercises.
Students are encouraged to review examples, practice critiques, and be prepared for the upcoming hands-on design activities.

Interactive Quiz on Chart Types and Principles

After the project discussion, the professor gave a quiz to reinforce visualization concepts:

Chart Identification & Variables:
- Example Question: Given a certain chart, identify whether it is a bar chart, column chart, grouped column chart, or histogram.
- Key Point: A histogram visualizes the distribution of a quantitative variable divided into bins.
  - The bins represent continuous ranges of values, not categories.
  - Histograms ideally have no gaps between bars.
- Common Mistake to Note: If the x-axis represents categories (like “Age Groups”) but the axis values are actually continuous numeric ranges, that indicates a histogram. If there are gaps or the bins are not equal, it’s a flawed histogram but still conceptually a histogram.
Banking to 45 Degrees (Line Charts):
- Key Point: Line charts are best interpreted when slopes are around 45°. This principle, known as “banking to 45°”, comes from Cleveland & McGill’s perceptual studies.
- Other listed “principles” like tilting to 35°, sloping to 25°, or binning to 55° are not real. The correct principle is “banking to 45°.”
Stacked Area Charts vs. Other Charts:
- Key Point: Stacked area charts require data that sum up to a meaningful whole.
- For example, if each layer represents a department’s sales, stacking them gives the company’s total sales over time.
- Histograms, grouped bar charts, or grouped area charts (the latter doesn’t really exist as a standard term) do not require data that sum to a whole.
Connected Scatter Plot:
- A connected scatter plot takes regular scatter plot points (each with x and y values) and connects them in the order of observation (often time).
- Distinguishing features:
  - Shows a trajectory or path through time or another sequence.
  - Different from a line chart because both x and y can represent different quantitative attributes (not necessarily time on the x-axis).
Overcrowding and Overplotting in Scatterplots:
- Key Point: For overplotting (points placed exactly on top of each other), use transparency or jittering.
- For overcrowding (too many points making a “cloud”), use binning (hexbin plots, 2D histograms) or density maps (heatmaps) to show data distribution more clearly.

Quiz Summary:

Histograms = continuous numeric bins.
Banking line charts to 45° = optimal readability.
Stacked area charts = data summation must be meaningful.
Connected scatter plot = connect points in observed order.
Overplotting vs. overcrowding = different techniques to handle them.

Data Visualization, Redo

This post is licensed under CC BY 4.0 by the author.

DAVI Interaction - redo

Interaction in Visualization

Why Interaction?

Visual Information Seeking Mantra (Shneiderman)

Overview & Detail vs. Focus & Context

Navigation & Space-Scale Diagrams

Navigational Cues

Multiple Coordinated Views (MCV)

Tangible Interaction and Software Architecture

Visualization Critiques and Examples Given

Closing Notes

Interactive Quiz on Chart Types and Principles

Trending Tags