DAVI Visualization Design redo

Posted Dec 10, 2024 Updated Dec 19, 2024

By Wei Xiong

8 min read

Topic: Transitioning from Interaction Techniques to Data Visualization Design Processes, Data & Task Abstractions, and the Design Activity Framework

Introduction & Recap

The professor starts by greeting the class and notes that today’s lecture is structured into two parts:
1. Briefly finish the few leftover slides from the “Interaction” lecture (previous week).
2. Dive into the design process for data visualization.
As usual, there is a short quiz at the beginning of the lecture to “wake everyone up.”

Public Service Announcement: Danish National Championships in AI

The annual Danish National Championships in AI are taking place this fall.
There is a prize of around 40,000 kroner.
Students can participate individually or in teams.
The professor suggests leveraging visualization plus machine learning or deep learning skills from current courses.

Quiz Review

Context: The quiz focuses on concepts from the previous lecture related to interaction techniques.

Questions and Answers:

Question: How do we call the diagrams that show a zoom level on the y-axis and a simplified screen space on the x-axis?
- Possible Answers: Angular plots, Two and a half (2.5D) representations, Sunburst, or Space-scale diagrams.
- Correct Answer: Space-scale diagrams.
- Explanation: These diagrams show different zoom levels over space. Sunburst diagrams are related to tree data, not space-scale. Angular plots was a made-up term. 2.5D representation is not correct here.
Question: Interactive lenses are an example of which principle?
- Possible Answers: Overview and detail, Pan and zoom, Drag and drop, Focus and context.
- Correct Answer: Focus and context.
- Explanation: Interactive lenses embed a high-detail focus region within a contextual overview. Overview+detail separates the two views; focus+context integrates them into one.
Question: The combination of panning and zooming, giving both origin and destination in view, is called what?
- Possible Answers: View switching, Smooth navigation, Interactive lens, Overview and detail.
- Correct Answer: Smooth navigation.
- Explanation: Smooth navigation (Nui & FatBike concept) provides a fluid transition between two viewpoints, ensuring both start and end points remain visible during the navigation.
Question: Which uses temporal separation for overview and detail?
- Possible Answers: Interactive lenses, Zooming, Brushing and linking, Minimap.
- Correct Answer: Zooming.
- Explanation: Zooming requires time passing to move from an overview to a detailed view. Interactive lenses are focus+context, brushing and linking is for multiple coordinated views, and a minimap is spatial separation, not temporal.
Question (Technical Issues): Another question about lenses and parallel pipelines.
- The professor clarifies that interactive lenses are realized using a parallel visualization pipeline, not overview+detail, not brushing and linking, etc.

Summary of Quiz Concepts:

Space-scale diagrams represent zoom and space in a 2D plot.
Interactive lenses are a focus+context technique, not overview+detail.
Smooth navigation combines panning and zooming to keep origin and destination visible.
Zooming is a temporal approach to overview+detail.
Interactive lenses are realized using a parallel visualization pipeline approach for flexibility and stackability.

Tangible Interaction and Pseudo-Haptics

Tangible Interaction

Example: A multi-touch table with physical rings (3D printed) placed on the table surface.
The table detects each ring (via markers).
Each ring acts as a “lens”: placing a ring over a map switches the underlying layer or view (e.g., from satellite to street map), turning it like a dial adjusts blend parameters.
Multiple rings can be stacked to combine lens functions (smart lenses).
Tangible props can facilitate exploration in ways a mouse cannot, lowering barriers to entry for non-technical users.

Another Example: Paper Prop for 3D Interaction

Using a depth-sensing camera and a coded piece of paper rolled up into a cylinder.
Users manipulate 3D microscopy data (fibers in tissues) by moving and rotating a paper prop.
This tangible user interface helps domain experts intuitively explore complicated 3D data.
Although advanced, such interfaces show how tangible interaction can simplify complex navigation tasks.

Pseudo-Haptics (Lightweight Haptics)

Without special hardware, modify cursor speed or cursor size to “feel” data density.
Example: Slowing cursor movement over dense data regions (like a “bump”).
Changing cursor size for fine-grained navigation.
Similar to “snap to grid” in PowerPoint—an existing example of pseudo-haptics.
Useful for subtle guidance and improving data exploration without extra hardware.

Transition to Data Visualization Design Process

Brief mention of software engineering slides (postponed).
Main focus now: The design process for data visualization.

Understanding the Visualization Problem

Data-Information-Knowledge-Wisdom (DIKW) Hierarchy

Data: Raw facts, measurements, or observations.
Information: Derived from data (distributions, relations, patterns).
Knowledge: Understanding causal relations, building models, functional dependencies.
Wisdom: Using knowledge for predictions, decision-making.

In visualization, we mostly deal with turning data into information that supports gaining knowledge and possibly wisdom.

Data Context vs. Data Content

Data Context (Domain/Reference Space/Independent Variables): Frame of reference (e.g., time, space, patient ID) that relates observations.
Data Content (Attribute Space/Dependent Variables): Actual measured values (e.g., temperature, CO2 concentration).

Data often modeled via graph topologies:

Regular/Irregular grids
Networks, Hierarchies

For continuous fields from sparse points, use interpolation (e.g., Inverse Distance Weighting) to estimate intermediate values.

Task Abstractions

A Task is what the user wants to do with a visualization.
Task lies between:
- User’s intent (“explore data”) and
- Actual interactions (“click”, “drag”).
Mid-level tasks: like Shneiderman’s mantra (“Overview first, zoom & filter, details on demand”) or simpler tasks like identify, locate, compare.

Types of Tasks

Elementary tasks: Identifying a value, locating items by value, comparing two points.
Synoptic tasks: Observing patterns, trends, clusters over sets of data, not just single points.
Andre & Andrew’s Framework:
- Lookup & Inverse Lookup
- Comparison & Inverse Comparison
- Relation Seeking (given relations, find data points)
- Synoptic equivalents: Pattern definition, pattern search, etc.

Why tasks matter:
They help ensure expressiveness (matching data types to visual encodings) and effectiveness (visualization supports intended user tasks).

Design Processes for Visualization

Design Activity Framework

Understand: Determine the visualization problem, user needs, data/task abstractions.
- Outcomes: Visualization requirements.
Ideate: Generate visualization ideas.
- Use sketching, low-fidelity mockups.
- Outcomes: A set of possible solution ideas.
Make: Create prototypes (interactive hi-fi prototypes).
- Outcome: Testable visualization prototypes.
Deploy: Implement final, robust visualization systems.

We focus on the Understand/Ideate/Make steps. Deploy is often a separate engineering challenge.

The Five Design Sheet Method (for Ideation)

Brainstorm Sheet:
- Generate ~15-20 rough idea sketches.
- Filter out non-viable ones.
- Categorize similar ideas.
- Combine and refine to pick 3 main ideas.
3 Separate Idea Sheets (2nd-4th sheets):
- For each chosen idea:
  - Show big picture layout.
  - Highlight key operations, interactions.
  - Show look & feel of crucial elements.
  - Discuss pros & cons.
Realization Sheet (5th sheet):
- Chosen final design.
- More detailed plan: algorithms, data handling, milestones, feasibility.

Benefits of 5 Design Sheets:

Structured creativity for non-artists.
Documented design rationale (proof of work).
Facilitates client communication and possible billing for partial work.

Guiding Design Principles (Andy Kirk)

Functional Design: Function before form. Must work correctly first.
Deliberate Design: Every element must have a justified purpose.
Intuitive Design: Should not be more complex than the data demands.
Ethical Design: Do not mislead the viewer.

Evaluation (Munzner’s Nested Model)

Layers: Domain situation → Data/task abstraction → Visual encoding → Algorithms.
Threats and Validation: Each layer has potential pitfalls; validate accordingly.
- Algorithms: Test complexity, runtime.
- Visual encoding: Test for overplotting, clarity.
- Tasks: User studies, feedback loops.

Benchmarking

Standard datasets, pixel-based metrics (e.g., overplot percentage).
Graph metrics (edge crossing counts).
User Studies: Measure task completion time, error rates, user experience.
Field Studies: Demonstrate how new visualizations lead to novel discoveries.

Visualization Critiques (Examples Shown by the Professor)

Example 1: Parallel Sets Visualization

Data: People displaced by disasters and conflicts.
Chart type: Sankey-like parallel sets.
Good aspects: Color choice works in grayscale, corrected shape to reduce perceptual illusions.
Critique:
- The chosen chart type (parallel sets) is meant for showing cross-correlations, but this chart does not show meaningful cross-correlations.
- The data categories and “other” grouping are inconsistent.
- Ethical choice of showing absolute numbers vs. normalization is context-dependent; here absolute counts emphasize humanitarian impact.

Suggested Improvement: Use simpler stacked bar charts for clarity if no cross-correlations are needed.

Example 2: Complex Connected Scatterplot for COVID Data

Data: COVID deaths over time, multiple countries.
Chart complexity confused readers; considered too complicated.
Issues:
- Negative values shown in parentheses (hard to interpret).
- Non-colorblind-safe color encoding.
- Overly smoothed data can be misleading.
- Connected scatterplots are among the most complex chart types; must provide reading guides.
Recommended approach: Simpler line charts, small multiples, or clear normalization.

Conclusion

We finalized discussion on interaction techniques and introduced tangible and pseudo-haptic interactions.
Main focus: Understanding visualization problems via data and task abstractions.
Learned about the design activity framework and how to ideate systematically using the Five Design Sheet method.
Discussed guiding principles for good visualization design and evaluation strategies.
Practiced critique on complex real-world visualizations, emphasizing the importance of clarity, appropriateness of chart types, and ethical considerations.

Next Lecture: Data Pre-processing.
Preparation: Read Chapter on “Preparing Data Tables” from “Making Sense of Data, Part 1.”

Reminders for the Next Session (Wednesday):

Bring your data and task abstraction for your project.
No need to bring the dataset itself, but know your data’s attributes, types (quantitative, categorical, etc.), and tasks.
You will sketch visualization ideas using the Five Design Sheet method.

Data Visualization, Redo

This post is licensed under CC BY 4.0 by the author.