Data Abstraction – Lecture 03

Analysis framework: Four levels, three questions

Domain sitation

Who are the target user?
Abstraction
Translate from specifics of domain to vocabulary of visualization
- What is show? Data abstraction
- Why is the user looking at it? Task abstraction
  Idiom
How is it shown?
- Visual encoding idiom: how to draw
- Interaction idiom: how to manipulate
  Algorithm
Efficient computation

Nest model

Downstream: cascading effects
Upstream: iterative refinement

What does data mean?

Semantics: real-world meaning of the data

Column names help with semantics
May also include rules about data
Useful for asking good question about the data
Data types: structural or mathematical interpretation of data
Item: individual entity, discrete
- a.k.a independent variable
Attribute: some specific property that is measured, observed, or logged
- a.k.a dependent variable
Links
- Express relationship between two items
Positions
- Spatial data
Grids
- Sampling strategy for continuous data
  Both often require metadata
we can infer some of this information

Dataset types – see lecture 03…..

Tables

Flat table
- one item per row
- each column is an attribute
- each cell holds value for item-attribute pair
- unique key (could be implicit)
Multidimensional table
- indexing based on multiple keys
  Network / Graph
nodes (vertices) connected by links (edges)
- node: synonym for item in the context of networks
- link: a relation between two nodes
tree is a special case: no cycles
- often have roots and are directed
  Fields
Grids: specify how data is sampled
- Positions: a location in space

Atrribute Types

Categorical
Ordered
- Ordinal
- Quantitative
  - time

Ordering Direction

Sequential
Diverging
Cyclic

Dataset Availability

Static
Dynamic

Data abstraction: Three operations

Data abstraction: Translate from domain-specific language to generic visualisation language

Identify dataset types, attribute types
Identify cardinality
- how many items in the dataset?
- what is the cardinality of each attribute?
  - number of levels of categorical data
  - range for quantitative data
Consider whether to transform data
- guided by understanding of task

Task Abstraction – Lecture 04

From domain to abstraction

Domain characterisation: details of application domain
- group of users, target domain, their questions & data
- domain questions / problems
  - break down into simpler abstract tasks
Abstraction: data & task
map what and why into generalised terms
- identify tasks that users wish to perform, or already do
- find data types that will support those tasks
  - possibly transform / derive if need be

Task abstraction: Actions and targets

Very high-level pattern

Actions

analyse
- high-level choices
- visualisation for consumption
  - discover new knowledge
    - generate new hypothesis or verify existing one
    - designer doesn’t know what users need to see
  - present known information
    - presenter already knows what the data says
    - wants to communicate this to an audience
  - enjoy
    - similar to discover, but without concrete goals
- visualisation for production
  - annotate
  - record
    - persist visualisations for historical record
  - derive
    - create new data
    - create derived attributes
    - crucial design choice
search
- find a known/unknown item
- what does the user know?
  - lookup: target known, laocation known
  - locate: target known, location unknown
  - browse: target unknown, location known
  - explore: target unknown, location unknown
query
- find out about characteristics of item
- How much of the data matter?
  - one: identify
    - characteristics or references
  - some: compare (often 2)
    - similarities or differences
  - all: summarise
    - overview of everthing
      Targets
what is being acted on
All data
- trends
- outliers
- features
Attributes
- One
  - Distribution
    - extremes
- Many
  - Dependency
  - Correlation
  - Similarity
Network Data
- Topology
  - Paths
Spatial Data
- Shape
  {action, target} pairs
discover distribution
compare trends
locate outliers
browse topology

Marks and Channels – Lecture 05

Visual encoding of information:

Marks: the basic graphical elements in a visualisation
Channels: way to control the appearance of the marks based on attributes
Marks as constraints
geometric primitives
classified by dimensionality
- 0D: points
- 1D: Lines
- 2D: Areas
- ……
marks for links
- containment
- connection
  Channels: expressiveness vs effectiveness
control appearance of marks
type and amount of information that can be conveyed to human perceptual system
classified
- position
  - horizontal
  - vertical
  - both
- color
- shape
- tilt
- size
  - length
  - area
  - volume
    expressiveness
match channel type to data type
expressiveness principle: all data from the dataset and nothing more should be show
- do encode ordered data in an ordered fashion
- don’t encode categorial data in a way that implies an ordering
  effectiveness
some channels are better than others
effectiveness priciple: the most important attributes should be the most salient
- saliency: how notceable something is
- how do the channels we have discussed measure up?
accuracy
- how precisely can we tell the difference between encoded items
discriminability
- how many unique steps can we perceive?
separability
- is our ability to use this channel affected by another one?
poppit
- can things jump out using this channel?
  ranking
magnitude channels: ordered attributes
- position on common scale
- position on unaligned scale
- length
- tilt / angle
- area
- depth
- color luminance
- color saturation
- curvature
- volume
identity channels: categorical attributes
- spatial region
- color hue
- motion
- shape
Accuracy / discriminability / separability / pop-out of visual channels
Relative vs absolute judgements of visual perception system

Data Visualisation -- Theories

Data Abstraction – Lecture 03

Analysis framework: Four levels, three questions

Domain sitation

Abstraction

Idiom

Algorithm

Nest model

What does data mean?

Semantics: real-world meaning of the data

Data types: structural or mathematical interpretation of data

Both often require metadata

Dataset types – see lecture 03…..

Tables

Network / Graph

Fields

Atrribute Types

Ordering Direction

Dataset Availability

Data abstraction: Three operations

Task Abstraction – Lecture 04

From domain to abstraction

Task abstraction: Actions and targets

Very high-level pattern

Actions

Targets

{action, target} pairs

Marks and Channels – Lecture 05

Visual encoding of information:

Marks as constraints

Channels: expressiveness vs effectiveness

expressiveness

effectiveness

ranking

Further Reading

Java Cheat Sheet for LeetCode

Blind 75

ES6 modules on Python http.server