Loading...

Messages

Proposals

Stuck in your homework and missing deadline? Get urgent help in $10/Page with 24 hours deadline

Get Urgent Writing Help In Your Essays, Assignments, Homeworks, Dissertation, Thesis Or Coursework & Achieve A+ Grades.

Privacy Guaranteed - 100% Plagiarism Free Writing - Free Turnitin Report - Professional And Experienced Writers - 24/7 Online Support

Data visualisation a handbook for data driven design andy kirk

25/11/2021 Client: muhammad11 Deadline: 2 Day

Data Visualisation

2

3

Data Visualisation

A Handbook for Data Driven Design

2nd Edition

Andy Kirk

Los Angeles London

New Delhi Singapore

Washington DC Melbourne

4

SAGE Publications Ltd

1 Oliver’s Yard

55 City Road

London EC1Y 1SP

SAGE Publications Inc.

2455 Teller Road

Thousand Oaks, California 91320

SAGE Publications India Pvt Ltd

B 1/I 1 Mohan Cooperative Industrial Area

Mathura Road

New Delhi 110 044

SAGE Publications Asia-Pacific Pte Ltd

3 Church Street

#10-04 Samsung Hub

Singapore 049483

© Andy Kirk 2019

First edition published 2016. Reprinted four times in 2016, twice in 2017, three times in 2018, and three times in 2019.

Apart from any fair dealing for the purposes of research or private study, or criticism or review, as permitted under the Copyright, Designs and Patents Act, 1988, this publication may be reproduced, stored or transmitted in any form, or by any means, only with the prior permission in writing of the publishers, or in the case of reprographic reproduction, in accordance with the terms of licences issued by the Copyright Licensing Agency. Enquiries concerning reproduction outside those terms should be sent to the publishers.

Library of Congress Control Number: 2018964578

British Library Cataloguing in Publication data

A catalogue record for this book is available from the British Library

ISBN 978-1-5264-6893-2

ISBN 978-1-5264-6892-5 (pbk)

Editor: Aly Owen

Editorial assistant: Lauren Jacobs

Production editor: Ian Antcliff

5

Copyeditor: Neville Hankins

Proofreader: Christine Bitten

Indexer: David Rudeforth

Marketing manager: Susheel Gokarakonda

Cover design: Shaun Mercier

Typeset by: C&M Digitals (P) Ltd, Chennai, India

Printed in the UK

At SAGE we take sustainability seriously. Most of our products are printed in the UK using responsibly sourced papers and boards. When we print overseas we ensure sustainable papers are used as measured by the PREPS grading system. We undertake an annual audit to monitor our sustainability.

6

Contents

Acknowledgements About the Author Discover Your Textbook’s Online Resources Introduction PART A FOUNDATIONS

1 Defining Data Visualisation 2 The Visualisation Design Process

PART B THE HIDDEN THINKING 3 Formulating Your Brief 4 Working With Data 5 Establishing Your Editorial Thinking

PART C DEVELOPING YOUR DESIGN SOLUTION 6 Data Representation 7 Interactivity 8 Annotation 9 Colour 10 Composition

Epilogue References Index

7

Acknowledgements

I could not have written this book without the unwavering support of my wonderful wife, Ellie, and my family. The book is dedicated to my inspirational Dad who sadly passed away before its publication. I want to acknowledge the contributions of the thousands of data visualisation practitioners who have created such a wealth of exceptional design work and smart writing. I have been devouring this for over a decade now and I am constantly inspired by the talents and minds behind it all. I also want to express my gratitude to the people and organisations who have granted me permission to reference and showcase their visualisation work in this book. Sincere thanks to the many people at Sage who have played a role in making this book grow from the first proposal and now to a second edition. Finally, to you the readers, I am hugely thankful that you chose to invest in this book. I hope it helps you in your journey to learning about this super subject.

8

About the Author

Andy Kirk is a freelance data visualisation specialist based in Yorkshire, UK. He is a visualisation design consultant, training provider, teacher, author, speaker, researcher and editor of the award- winning website visualisingdata.com.

After graduating from Lancaster University in 1999 with a BSc (hons) in Operational Research, Andy’s working life began with a variety of business analysis and information management roles at organisations including CIS Insurance, West Yorkshire Police and the University of Leeds. He discovered data visualisation in early 2007, when it was lurking somewhat on the fringes of the Web. Fortunately, the timing of this discovery coincided with his shaping of his Master’s (MA) degree research proposal, a self-directed research programme that gave him the opportunity to unlock and secure his passion for the subject. He launched visualisingdata.com to continue the process of discovery and to chart the course of the increasing popularity of the subject. Over time, this award-winning site has grown to become a popular reference for followers of the field, offering contemporary discourse, design techniques and vast collections of visualisation examples and resources. Andy became a freelance professional in 2011. Since then he has been fortunate to work with a diverse range of clients across the world, including organisations such as Google, CERN, Electronic Arts, the EU Council, Hershey and McKinsey. At the time of publication, he will have delivered over 270 public and private training events in 25 different countries, reaching more than 6000 delegates. Alongside his busy training schedule, Andy also provides design consultancy, his primary client being the Arsenal FC Performance Team, since 2015. In addition to his commercial activities, he maintains regular engagements in academia. Between 2014 and 2015 he was an external consultant on a research project called ‘Seeing Data’, funded by the Arts & Humanities Research Council and hosted by the University of Sheffield. This study explored the issues of data visualisation literacy among the general public and, inter alia, helped to shape an understanding of the human factors that affect visualisation literacy and the effectiveness of design. Andy joined the highly respected Maryland Institute College of Art (MICA) as a visiting lecturer in 2013 teaching a module on the Information Visualisation Master’s Programme through to 2017. From January 2016, he taught a data visualisation module as part of the MSc in Business Analytics at the Imperial College Business School in London through to 2018. As of May 2019, Andy has started teaching at University College London (UCL).

9

Discover Your Textbook’s Online Resources

Want more support around understanding and creating data visualisations? Andy Kirk is here to help, offline and on!

Hosted by the author and with resources organized by chapter, the supporting website for this book has everything you need to explore, practice, and hone your data visualisation skills.

Explore the field: expand your knowledge and reinforce your learning about working with data through libraries of further reading, references, and tutorials. Try this yourself: revise, reflect, and refine your skill and understanding about the challenges of working with data through practical exercises. See data visualisation in action: get to grips with the nuances and intricacies of working with data in the real world by navigating instalments of the narrative case study and seeing an additional extended example of data visualisation in practice. Follow along with Andy’s video diary of the process and get direct insight into his thought processes, challenges, mistakes, and decisions along the way. Chartmaker directory: access crowd-sourced guidance that aims to answer the crucial question ‘which tools make which charts?’ with this growing directory of examples and technical solutions for chart building.

Ready to learn more? Go beyond the book and dive deeper into data visualisation via the rest of Andy’s website (www.visualisingdata.com), which contains data visualisation tools and software, links to additional influential further reading, and a blog with monthly collections of the best data visualisation examples and resources each month.

10

http://www.visualisingdata.com
Introduction

The primary challenge one faces when writing a book about data visualisation is to determine what to leave in and what to leave out. Data visualisation is a big subject. There is no single book to rule it all because there is no one book that can truly cover it all. Each and every one of the topics covered by the chapters in this book could (and, in several cases, do) exist as books in their own right.

The secondary challenge when writing a book about data visualisation is to decide how to weave the content together. Data visualisation is not rocket science; it is not an especially complicated discipline, though it can be when working on sophisticated topics and with advanced applications. It is, however, a complex subject. There are lots of things to think about, many things to do and, of course, things that will need making. Creative and journalistic sensibilities need to blend harmoniously with analytical and scientific judgement. In one moment, you might be checking the statistical rigour of an intricate calculation, in the next deciding which shade of orange most strikingly contrasts with a vibrant blue. The complexity of data visualisation manifests in how the myriad small ingredients interact, influence and intersect to form a whole.

The decisions I have made when formulating this book’s content have been shaped by my own process of learning. I have been researching, writing about and practising data visualisation for over a decade. I believe you only truly learn about your own knowledge of a subject when you have to explain it and teach it to others. To this extent I have been fortunate to have had extensive experience designing and delivering commercial training as well as academic teaching.

I believe this book offers an effective and proven pedagogy that successfully translates the complexities of this subject in a form that is fundamentally useful. I feel well placed to bridge the gap between the everyday practitioners, who might identify themselves as beginners, and the superstar talents expanding the potential of data visualisation. I am not going to claim to belong to the latter cohort, but I have certainly been a novice, taking tentative early steps into this world. Most of my working hours are spent helping others start their journey. I know what I would have valued when I started out in this field and this helps inform how I now pass this on to others in the same position I was several years ago.

There is a large and growing library of fantastic books offering different theoretical and practical viewpoints on this subject. My aim is to add value to this existing collection by approaching the subject through the perspective of process. I believe the path to mastering data visualisation is achieved by making better decisions: namely, effective choices, efficiently made. I will help you understand what decisions need to be made and give you the confidence to make the right choices. Before moving on to discuss the book’s intended audience, here are its key aims:

To challenge your existing approaches to creating and consuming visualisations. I will challenge your beliefs about what you consider to be effective or ineffective visualisation. I will encourage you to eliminate arbitrary choices from your thinking, rely less on taste and instinct, and become more reasoned in your judgements. To enlighten you I will increase your awareness of the possible approaches to visualising

11

data. This book will broaden your visual vocabulary, giving you a wider and more sophisticated understanding of the contemporary techniques used to express your data visually. To equip is to provide you with robust tactics for managing your way through the myriad options that exist in data visualisation. To help you overcome the burden of choice, an adaptable framework is offered to help you think for yourself, rather than relying on inflexible rules and narrow instruction. To inspire is to open the door to a subject that will stimulate you to elevate your ambition and broaden your confidence. Developing competency in data visualisation will take time and will need more than just reading this book. It will require a commitment to embrace the obstacles that each new data visualisation opportunity poses through practice. It will require persistence to learn, apply, reflect and improve.

12

Who Is This Book Aimed At?

Anyone who has reason to use quantitative and qualitative methods in their professional or academic duties will need to grasp the demands of data visualisation. Whether this is a large part of your duties or just a small part, this book will support your needs.

The primary intended audiences are undergraduates, postgraduates and early-career researchers. Although aimed at those in the social sciences, the content will be relevant to readers from across the spectrum of arts and humanities right through to the natural sciences.

This book is intended to offer an accessible route for novices to start their data visualisation learning journey and, for those already familiar with the basics, the content will hopefully contribute to refining their capabilities. It is not aimed at experienced or established visualisation practitioners, though there may be some new perspectives to enrich their thinking: some content will reinforce existing knowledge, other content might challenge their convictions.

The people who are active in this field come from all backgrounds. Outside academia, data visualisation has reached the mainstream consciousness in professional and commercial contexts. An increasing number of professionals and organisations, across all industry types and sizes, are embracing the importance of getting more value from their data and doing more with it, for both internal and external benefit. You might be a market researcher, a librarian or a data analyst looking to enhance your data capabilities. Perhaps you are a skilled graphic designer or web developer looking to take your portfolio of work into a more data-driven direction. Maybe you are in a managerial position and though not directly involved in the creation of visualisation work, you might wish to improve the sophistication of the language you coordinate or commission others who are. Everyone needs the lens and vocabulary to evaluate work effectively.

Data visualisation is a genuinely multidisciplinary discipline. Nobody arrives fully formed with all constituent capabilities. The pre-existing knowledge, skills or experiences which, I think, reflect the traits needed to get the most out of this book would include:

Strong numeracy is necessary as well as a familiarity with basic statistics. While it is reasonable to assume limited prior knowledge of data visualisation, there should be a strong desire to want to learn it. The demands of learning a craft like this take time and effort; the capabilities will need nurturing through ongoing learning and practice. They are not going to be achieved overnight or acquired alone from reading this book. Any book that claims to be able magically to inject mastery through just reading it cover to cover is over- promising and likely to under-deliver. The best data visualisers possess inherent curiosity. You should be the type of person who is naturally disposed to question the world around them. Your instinct for discovering and sharing answers will be at the heart of this activity. There are no expectations of your having any prior familiarity with design principles, but an appetite to embrace some of the creative aspects presented in this book will heighten the impact of your work. Time to unleash that suppressed imagination! If you are somebody fortunate to possess already a strong creative flair, this book will guide you through when and crucially when not to tap into this sensibility. You should be willing

13

to increase the rigour of your analytical decision making and be prepared to have your creative thinking informed more fundamentally by data rather than just instinct. No particular technical skills are required to get value from this book, as I will explain shortly. But you will ideally have some basic knowledge of spreadsheets and experience of working with data irrespective of which particular tool.

This is a portable practice involving techniques that are subject-matter agnostic. Throughout this book you will see a broad array of examples from different industries covering many different topics. Do not be deterred by any example being about a subject different to your own area of interest. Look beyond the subject and you will see analytical and design choices that are just as applicable to you and your work: a line chart showing political forecasts involves the same thought process as would a line chart showing stock prices changing or average global temperatures rising. A line chart is a line chart, regardless of the subject matter.

The type of data you are working with is the only legitimate restriction to the design methods you might employ, not your subject and certainly not traditions in your subject. ‘Waterfall charts are only for people in finance’, ‘maps are only for cartographers’, ‘Sankey diagrams are only for engineers’. Enter this subject with an open mind, forget what you believe or have been told is the normal approach, and your capabilities will be expanded.

Data visualisation is an entirely global community, not the preserve of any geographic region. Although the English language dominates written discourse, the interest in the subject and work created from studios through to graphics teams originates everywhere. There are cultural influences and different flavours in design sensibility around the world which enrich the field but, otherwise, it is a practice common and accessible to all.

14

Finding the Balance

15

Handbook vs Manual

The description of this book as a ‘handbook’ positions it as distinct from a tutorial-based manual. It aims to offer conceptual and practical guidance, rather than technical instruction. Think of it more as a guidebook for a tourist visiting a city than an instruction manual for how to fix a washing machine.

Apart from a small proportion of visualisation work that is created manually, the reliance on technology to create visualisation work is an inseparable necessity. For many beginners in visualisation there is an understandable appetite for step-by-step tutorials that help them immediately to implement their newly acquired techniques.

However, writing about data visualisation through the lens of selected tools is hard, given the diversity of technical options that exist in the context of such varied skills, access and needs. The visualisation technology space is characterised by flux. New tools are constantly emerging to supplement the many that already exist. Some are proprietary, others are open source; some are easier to learn but do not offer much functionality; others do offer rich potential but require a great deal of foundation understanding before you even accomplish your first bar chart. Some tools evolve to keep up with current techniques; they are well supported by vendors and have thriving user communities, others less so. Some will exist as long-term options whereas others depreciate. Many have briefly burnt brightly but quickly become obsolete or have been swallowed up by others higher up the food chain. Tools come and go but the craft remains.

There is a role for all book types and a need for more than one to acquire true competency in a subject. Different people want different sources of insight at different stages in their development. If you are seeking a text that provides instructive tutorials, you will learn from this how to accomplish technical developments in a given technology. However, if you only read tutorial-based books, you will likely fall short in the fundamental critical thinking that will be needed to harness data visualisation as a skill.

I believe a practical, rather than technical, text focusing on the underlying craft of data visualisation through a tool-agnostic approach offers the most effective guide to help people learn this subject.

The content of this book will be relevant to readers regardless of their technical knowledge and experience. The focus will be to take your critical thinking towards a detailed, fully reasoned design specification – a declaration of intent of what you want to develop. Think of the distinction as similar to that between architecture (design specification) and engineering (design execution).

There is a section in Chapter 3 that describes the influence technology has on your work and the places it will shape your ambitions. Furthermore, among the digital resources offered online are further profiles of applications, tools and libraries in common use in the field today and a vast directory of resources offering instructive tutorials. These will help you to apply technically the critical capabilities you acquire throughout this book.

16

17

Useful vs Beautiful

Another important distinction to make is that this book is not intended to be seen as a beauty pageant. I love flicking through glossy ‘coffee table’ books as they offer great inspiration, but often lack substance beyond the evident beauty. This book serves a different purpose to that. I believe, for a beginner or relative beginner, the most valuable inspiration comes more from understanding the thinking behind some of the amazing works encountered today, learning about the decisions that led to their conceptual development.

My desire is to make this the most useful text available, a reference that will spend more time on your desk than on your bookshelf. To be useful is to be used. I want the pages to be dog-eared. I want to see scribbles and annotated notes made across its pages and key passages underlined. I want to see sticky labels peering out above identified pages of note. I want to see creases where pages have been folded back or a double-page spread that has been weighed down to keep it open. It will be an elegantly presented and packaged book, but it should not be something that invites you to look but not touch.

18

Pragmatic vs Theoretical

The content of this book has been formed through years of absorbing knowledge from as many books as my shelves can hold, generations of academic work, endless web articles, hundreds of conference talks, personal interactions with the great and the good of the field, and lots and lots of practice. More accurately, lots and lots of mistakes. What I present here is a pragmatic distillation of what I have learned and feel others will benefit from learning too.

It is not a deeply academic or theoretical book. Experienced or especially curious practitioners may have a desire for deeper theoretical discourse, but that is beyond the intent of this particular text. You have to draw a line somewhere to determine the depth you can reasonably explore about a given topic. Take the science of visual perception, for example, arguably the subject’s foundation. There is no value in replicating or attempting to better what has already been covered by other books in greater quality than I could achieve.

An important reason for giving greater weight to pragmatism is because of the inherent imperfections of this subject. Although there is so much important empirical thinking in this subject, the practical application can sometimes fail to translate beyond the somewhat artificial context of a research study. Real-world circumstances and the strong influence of human factors can easily distort the significance of otherwise robust concepts.

Critical thinking will be the watchword, equipping you with the independence of thought to decide rationally for yourself which solutions best fit your context, your data, your message and your audience. To accomplish this, you will need to develop an appreciation of all the options available to you (the different things you could do) and a reliable approach for critically determining what choices you should make (the things you will do and why).

19

Contemporary vs Historical

I have huge respect for the ancestors of this field, the dominant names who, despite primitive means, pioneered new concepts in the visual display of statistics to shape the foundations of the field being practised today. The field’s lineage is decorated by pioneers such as William Playfair, W. E. B. Du Bois, Florence Nightingale and John Snow, to name but a few. To many beginners in the field, the historical context of this subject is of huge interest. However, this kind of content has already been covered by plenty of other book and article authors.

I do not want to bloat this book with the unnecessary reprising of topics that have been covered at length elsewhere. I am not going to spend time attempting to enlighten you about how we live in the age of ‘Big Data’ and how occupations related to data are or will be the ‘sexiest jobs’ of our time. The former is no longer news, the latter claim emerged from a single source. There is more valuable and useful content I want you to focus your time on.

The subject matter, the ideas and the practices presented here will hopefully not date a great deal. Of course, many of the graphic examples included in the book will be surpassed by newer work demonstrating similar concepts as the field continues to develop. However, their worth as exhibits of a particular perspective covered in the text should prove timeless. As time passes there will be new techniques, new concepts and new, empirically evidenced rules. There will be new thought-leaders, new sources of reference and new visualisers to draw insight from. Things that prove a manual burden now may become seamlessly automated in the near future. That is the nature of a fast-growing field.

20

Analysis vs Communication

A further distinction to make concerns the subtle but critical difference between visualisation used for analysing data and visualisation used for communicating data.

Before a visualiser can confidently decide what to communicate to others, he or she needs to have developed an intimate understanding of the qualities and potential of the data. In certain contexts, this might only be achieved through exploratory data analysis. Here, the visualiser and the viewer are the same person. Through visual exploration, interrogations of the data can be conducted to learn about its qualities and to unearth confirmatory or enlightening discoveries about what insights exist.

Visualisation for analysis is part of the journey towards creating visualisation for communication, but the techniques used for visual analysis do not have to be visually polished or necessarily appealing. They are only serving the purpose of helping you truly to learn about your data. When a data visualisation is being created to communicate to others, many careful considerations come into play about the requirements and interests of the intended audience. This influences many design decisions that do not exist alone with visual analysis.

For the scope of this book the content is weighted more towards methods and concerns about communicating data visually to others. If your role is concerned more with techniques for exploratory analysis rather than visual communication, you will likely require a deeper treatment of the topic than this book can reasonably offer.

Another matter to touch on here concerns the coverage of statistics, or lack thereof. For many people, statistics can be a difficult topic to grasp. Even for those who are relatively numerate and comfortable working with simple statistical methods, it is quite easy to become rusty without frequent practice. The fear of making errors with intricate statistical calculations depresses confidence and a vicious circle begins.

You cannot avoid the need to use some statistical techniques if you are going to work with data. I will describe some of the most relevant statistical techniques in Chapter 4, at the point in your thinking where they are most applicable. However, I do believe the range and level of statistical techniques most people will need to employ on most of their visualisation tasks can be overstated. I know there will be exceptions, and a significant minority will be exposed to requiring advanced statistical thinking in their work.

It all depends, of course. In my experience, however, the majority of data visualisation challenges will generally involve relatively straightforward univariate and bivariate statistical techniques to describe data. Univariate techniques help you to understand the shape, size and range of a single variable of data, such as determining the minimum, maximum and average height of a group of people. Bivariate techniques are used to observe possible relationships between two different variables. For example, you might look at the relationship between gross domestic product and medal success for countries competing at the Olympics. You may also encounter visualisation challenges that require a basic understanding of probabilities to assist with forecasting risk or modelling uncertainty.

21

The more advanced applications of statistics will be required when working with larger complicated datasets, where multivariate techniques are employed simultaneously to model the significance of relationships between multiple variables. Above and beyond that, you are moving towards advanced statistical modelling and algorithm design.

Though it may seem unsatisfactory to offer little coverage of this topic, there is no value in reinventing the wheel. There are hundreds of existing books better placed to offer the depth you might need. That statistics is such a prolific and vast field in itself further demonstrates how deeply multidisciplinary a field visualisation truly is.

22

Chapter Contents

The book is organised into three main parts (A, B and C) comprising ten chapters and an Epilogue. Each chapter opens with a preview of the content to be covered and closes with a summary of the most salient learning points to emerge. There are collections of further resources available online to substantiate the learning from each chapter.

For most readers, especially beginners, it is recommended that you start from the beginning and proceed through each chapter as presented. For those setting out to begin working on their own visualisation, you might jump straight into Chapters 2–5 to ensure you are fully prepared for some of the important preparatory activities you need to accomplish before moving on to look at developing your design solution. For those with more experience and/or prior exposure to this subject, who are perhaps looking to fine-tune specific aspects of their design skills, most of your interest will lie in Part C, comprising Chapters 6–10. For readers who just want to dip in and out of specific topic areas, although each chapter builds sequentially from the preceding ones, they can all be read in isolation. Follow any sequence that satisfies your needs. The coloured tabs on the outer edge will provide quick visual navigation through the distinct parts and chapters within.

23

Part A: Foundations

Part A introduces some important foundational understanding about data visualisation as a subject area and as an activity. The contents of the first two chapters give shape to the coverage across the rest of the book.

Chapter 1 ‘Defining Data Visualisation’ will be the logical starting point for those who are new to the field, providing a definition for the subject and exploring some of the tensions that enrich this subject. The second section explains some of the distinctions and overlaps with other related disciplines. If you already know what data visualisation is about, you might choose to pass on this; it does, though, help frame many of the discussions elsewhere.

Chapter 2 ‘The Visualisation Design Process’ introduces the value of following a design process, the sequence of activities around which the book’s contents in Parts B and C are organised. It explains what is involved and offers some useful tips to help you seamlessly adopt this approach. Where the process offers organisation and efficiency, design principles ensure effectiveness. The second section will describe what separates the good from the bad in visualisation design, building up your convictions to help with your upcoming decision making.

24

Part B: The Hidden Thinking

Part B profiles the first three stages of the data visualisation design process. These are the hidden preparatory stages that will significantly influence the path you take towards an eventual solution.

Chapter 3 ‘Formulating Your Brief’ covers the opening tasks involved in initiating, defining and planning the requirements of your work. The first section looks at issues around context, specifically about the importance of defining curiosity and identifying the circumstances that will shape your project. The second section considers the vision of your work, looking at what purpose it intends to serve and how you might creatively define the type of work you will need to pursue. Finally, a short section looks at the value of harnessing initial ideas.

Chapter 4 ‘Working With Data’ commences your practical involvement with your data, stepping through the four distinct steps that acquaint you with the potential of your critical raw material. Data acquisition outlines the different origins of and methods for obtaining your data. Data examination profiles the different characteristics that define the type, extent and condition of your data. Data transformation builds on your examination work to find ways of modifying and enhancing your data to prepare it for use. Finally, data exploration discusses methods for discovering more about the qualities and insights hidden away in your data.

Chapter 5 ‘Establishing Your Editorial Thinking’ reflects on the possibilities offered by your data and explains the importance of committing to an editorial path. The chapter opens with a definition about the influence of editorial thinking, using two case studies to explain how editorial definitions influence design choices later in the process.

25

Part C: Developing Your Design Solution

Part C represents the main part of this book and covers the five distinct layers of the data visualisation anatomy. They are presented in separate chapters to help organise your thinking and to avoid being overwhelmed by the detailed options that exist. However, they are ultimately interrelated matters and the chapter sequencing across this part is carefully arranged to support this. Each chapter follows a similar structure, opening with an array of different possible design options and supplemented by guidance on the factors that will influence your choices. Initially, you will need to make decisions about what elements to include around data representation (charts), interactivity and annotation. You will then complete your thinking about the appearance of these elements, through colour and composition.

Chapter 6 ‘Data Representation’ introduces the act of visual encoding and then expands on this to provide a detailed profile of 49 distinct chart types to help broaden your visual vocabulary. The chapter closes with a run through the key factors that will influence the suitability of your data representation choices.

Chapter 7 ‘Interactivity’ introduces the potential value of incorporating interactive features in your work, profiling a wide range of options – such as filtering, highlighting and animating – that will enable users to interrogate and control a visualisation. The chapter closes with the main considerations that will influence your selection of interactive features.

Chapter 8 ‘Annotation’ describes the importance of providing useful assistance to your viewers, including headings, chart apparatus, and labels. The chapter closes with a look at which factors will inform the choices you make.

Chapter 9 ‘Colour’ commences with an overview of different colour models. This provides the basis for understanding the different ways of applying colour to facilitate data legibility and deliver functional decoration. Once again, having introduced the options, we will look at how you arrive at appropriate choices.

Chapter 10 ‘Composition’ explores the final element of developing your design solution concerning how you organise the placement and sizing of all your visual elements within the space you have to work. Looking at matters of layout, arrangement and chart sizing, we will then wrap up this topic with a discussion about how to make your decisions.

Epilogue: To close the book, the epilogue will summarise the development cycle of activities you will need to undertake as you move your detailed design specification to a fully executed solution.

26

Digital Resources

The opportunity to supplement the print version of this book with further digital companion resources helps to offer readers a range of additional learning materials:

a written and video-based case-study of a visualisation project that demonstrates the design process in action; an extensive and up-to-date catalogue of over 350 data visualisation tools; a large collection of tutorials and resources to help develop your technical capabilities in making a wide range of different charts; useful exercises designed to help embed the learning covered in each chapter; a digital gallery of all the artwork included in this book and many further examples of the concepts presented across all chapters; refreshed reading resources to support ongoing learning about the subjects covered in each chapter.

27

Glossary

Consistency in the meaning of language and terms used in data visualisation is important. Though data visualisation is no different to many fields that get bogged down by superfluous semantic noise, it can only help to establish clarity about its usage in this book at least.

28

Roles

Visualiser: This is the role I am assigning to you – the person making the visualisation. Sometimes people prefer to use terms like researcher, analyst, developer, storyteller or even ‘visualist’. Designer would also be particularly appropriate, but I want to broaden the scope of the role beyond just design to cover all activities involved in this discipline.

Viewer: This is the role assigned to the recipient, who is viewing or using your visualisation product. It offers a broader and better fit than alternatives such as consumer, reader, user or customer. However, ‘user’ will be temporarily adopted during the more active chapter about interactivity.

Audience: This concerns the collective group of viewers for whom your work is intended. Within an audience there will be cohorts of different viewer types that you might characterise through distinct personas to help your thinking about serving their varied needs.

Consuming: This will be the general act of the viewer, to consume. I will use more active descriptions like ‘reading’ and ‘using’ when consuming becomes too passive or vague, and when distinctions are needed between reading a chart and using interactive features.

29

Data

Raw data: For the purpose of this book, raw data will be the initial state of data you have collected, received or downloaded that has not yet been subjected to any statistical or transforming treatment. Some people take issue with the implied ‘rawness’ this label implies, given that data will have already lost its raw state having been recorded by some instrument, stored, retrieved and maybe cleaned already. I appreciate this viewpoint but think it is the most pragmatic label relevant to most people’s understanding.

Data source: This is the term used to describe the origin(s) of the raw data used in a visualisation.

Dataset: A table of data is an array of values visually arranged into rows and columns, usually existing in a spreadsheet or database. The rows are the records – instances or items – and the columns are the variables – details about the items. Datasets are visualised in order to ‘see’ the size, patterns and relationships that are otherwise hard to observe. A dataset may comprise one or a collection of several tables.

Tabulation: For the purpose of this book, I distinguish between types of datasets that are ‘normalised’ and others that are ‘cross-tabulated’. This distinction will be explained in context in Chapter 4.

Data types: The variables (columns) in a table that hold details about items (records) will have different scales of measurement or data types. At the most general level, distinctions in quantitative (e.g. salary) and categorical (e.g. gender) data are important in how you will statistically and visually handle them. A detailed distinction between data types, with examples, will again be offered in Chapter 4.

Series: A series of values is essentially a sequence of related values in a table. An example of a series would be the highest recorded temperatures in a city for each day over a month. Though individual daily values will be stored as distinct moment-in-time measurements, the activity of temperature never stops ‘happening’ and therefore the collected values have a legitimate continuous relationship through the series.

30

Visualisation

Project: For the purpose of this book, we will consider the development of a data visualisation as being a project. Even though you might consider something a quick, small task, it will still need to involve the thinking consistent with the stages of the process covered in this book.

Chart type: Charts are visual representations of data. There are many ways of represent- ing your data, using different combinations of marks, attributes, layouts and apparatus. Their combinations form archetypes of charts more commonly named chart types, such as the bar chart, dendrogram or treemap.

Graphs, plots, diagrams and maps: Traditionally the term graph has been used to describe visualisations that display network relationships, while chart would be commonly used to label common devices like the bar or pie chart. Plots and diagrams are more specifically attached to special types of displays but with no pattern of consistency in their usage. All these terms are so interchangeable that any energy expended in explaining meaningful difference is redundant. For the purpose of this book, I will generally stick to the term chart to act as the main label to cover all representation types. In places, this ‘umbrella’ term will incorporate thematic maps, for the sake of convenience, even though they clearly have a visual structure that is quite different to standard charts.

Graphic: The term graphic will be used when referring to visuals more focused on information- led displays such as explanation or process diagrams as distinct from charts that are concerned with data-driven visuals. It might also be used to refer more broadly to a visualisation that incorporates charts, text and images.

Format: This concerns the difference in output form between printed work, digital work and physical visualisation work.

Functionality: This concerns the difference in whether a visualisation is static or interactive. Interactive visualisations allow you to manipulate and interrogate a computer-based display of data. They are published on the Web, exist within apps, or are on larger digital displays, as in galleries. In contrast, a static visualisation displays a non-changeable, still display of data that could be published in print but also digitally. Just because something is published digitally does not automatically make it interactive.

Axes: Many common chart types have axis lines that provide a reference for measuring quantitative values or positioning categorical values. The horizontal axis is known as the x-axis and the vertical axis is known as the y-axis.

Scales: Scales exist in two forms, typically. Firstly, as a set of marks along an axis that indicate positions for the range of values included in a chart. Scales are normally presented in regular intervals (10, 20, 30, etc.) representing units of measurement, such as prices, distances, years or percentages. A scale may also be presented in a key to explain associations between, for example, different sizes of areas or classifications of different colour attributes.

31

Legend: Charts that employ visual attributes, such as colours, shapes or sizes to represent values of data, will often be accompanied by a legend to house visual explanations of classifications, known as keys.

Outliers: Outliers are points of data that are outside the normal range of values. They are the unusually large or small or simply different values that stand out and generally draw a viewer’s attention.

Correlation: This is a measure of the presence and extent of a mutual relationship between two or more variables of data. For example, you would expect to see a correlation between the height and weight of people or age and salary of workers. Devices like scatter plots, in particular, help visually to portray possible correlations between two quantitative values.

32

Part A Foundations

33

1 Defining Data Visualisation

This opening chapter will introduce data visualisation through the prism of a proposed definition. Each component that forms this definition will be explored in depth to illustrate some of the main characteristics and complexities of this subject.

The second part of the chapter will position data visualisation in the context of other related disciplines or fields, explaining where overlaps or clear distinctions exist. Overall, this chapter will seek to forge a shared understanding that will help set the tone and reasoning for the structure of this book.

34

1.1 What Is Data Visualisation?

It is useful to commence this book with a definition of data visualisation (Figure 1.1). It helps to ensure we (you the reader, me the writer) have a mutual understanding, from the outset, about what is meant by data visualisation in the context of this text. The components of this definition carve the subject into distinct perspectives around which the contents of this book are organised.

Figure 1.1 A Definition for Data Visualisation

Let me delve into this and describe the roles of and relationships between each component expressed. I will also explain where and how these topics will be covered. Firstly, let’s look at data.

Data is names and amounts. It is groupings, descriptions and measurements. It is dates and locations. It will be helpful for discussions in this book to think of data as being typically structured in table form, with rows of records and columns of variables. Most data we commonly encounter will exist in textual, numeric or a combined form, but it is also worth noting the opportunities that increasingly exist for working with data assets in media forms of images, audio and video.

In Chapter 4 you will learn about the importance of developing an intimate understanding of your data to acquaint yourself fully with its properties, its condition and its qualities.

You will see that data is the fundamental element driving the decisions across this design process. Without data there is no material to feed nor necessitate a visualisation. Conversely, without visualisation the value of data can be unfulfilled. This is not to say we should always visualise data, absolutely not, but in most circumstances, to harness the maximum value of data, there are missed opportunities if we do not.

To explain, here is a simple illustration. When data is presented in a table, it is a straightforward task for a viewer to scan the rows and columns to seek out values of relevance or to discover particular data points that trigger interest. For instance, by viewing the table in Figure 1.2 it should prove quite simple to find out what the percentage share of online sales for a Company X was during April 2016. Now look for the percentage share of store sales during December 2011.

35

Figure 1.2 Proportion of Sales % by Channel Over Time

As a viewer your task is simply to find the relevant row and column intersection: look at the value display and read it. The percentage share of online sales for Company X during April 2016 is 84, and for store sales during December 2011 it is 71.

To find which sales channel had the second largest percentage share of sales during August 2014, again just find the relevant row, compare the three quantitative values along that row, and then determine which channel column contains the second-ranked amount. For this month, the online channel, at 44, had the second largest percentage share of sales.

The limitations of reading data when it is presented in this form emerge when we want to answer broader questions: that is, enquiries that transcend the scope of an answer originating from a single or small number of adjacent data points. From the same table, how easy do you find it to identify the headline trends across each sales channel over the period of time displayed?

You can probably ascertain that the percentage share of sales for stores starts quite high then drops to nothing, the percentage share of online sales starts quite low and then reaches the 100% maximum, and the percentage share of sales via telephone is consistently tiny.

Though it takes a while to study the values under each sales channel column in order to form this summary observation, it is still possible. But what if your observations need to be formed more quickly? What if you needed to know more about the localised patterns of ups and downs within those global trends? What if you wanted to identify the first occasion when the percentage share of online sales exceeded the percentage share of store sales? When was the last occasion the percentage share of store sales exceeded that of online sales? During which periods did the different sales channels experience the most accelerated upward or downward changes?

36

These are harder questions to answer efficiently and accurately from the data alone. This is because synthesising observations from multiple values across different rows and columns to perceive broader relationships fails to exploit fully the capabilities of our visual system – how our eyes and mind work together to make sense of objects and patterns. To read values in isolation, store them in our short-term memory and compare them in our head with other isolated values is mentally challenging. It is not impossible, since we can still accomplish this with just a table of data, but it will take an excessive amount of time and effort.

This workload will also only increase as the data grows in volume and complexity. For instance, what if this table were 1000 rows deep and there were 20, 50 or 100 different columns to work through? Or, what if the quantities had similar value sizes and more modest variation? How easy would it then be to notice significant patterns?

The crux of all this is that we can look at data, but we cannot really see it. To see data, we need to represent it in a different, visual form.

Returning to the definition, the term visual representation is arguably the quintessential activity of data visualisation. Representation involves making decisions about how you are going to portray your data visually so that the subject understanding it offers can be made accessible to your audience. In simple terms, this is all about charts and the act of selecting the right chart to show the features of your data that you think are most relevant.

Homework is Completed By:

Writer Writer Name Amount Client Comments & Rating
Instant Homework Helper

ONLINE

Instant Homework Helper

$36

She helped me in last minute in a very reasonable price. She is a lifesaver, I got A+ grade in my homework, I will surely hire her again for my next assignments, Thumbs Up!

Order & Get This Solution Within 3 Hours in $25/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 3 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 6 Hours in $20/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 6 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 12 Hours in $15/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 12 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

6 writers have sent their proposals to do this homework:

Homework Master
Write My Coursework
Top Quality Assignments
Academic Mentor
Smart Homework Helper
Assignment Guru
Writer Writer Name Offer Chat
Homework Master

ONLINE

Homework Master

I am an experienced researcher here with master education. After reading your posting, I feel, you need an expert research writer to complete your project.Thank You

$50 Chat With Writer
Write My Coursework

ONLINE

Write My Coursework

I am an elite class writer with more than 6 years of experience as an academic writer. I will provide you the 100 percent original and plagiarism-free content.

$39 Chat With Writer
Top Quality Assignments

ONLINE

Top Quality Assignments

I have read your project details and I can provide you QUALITY WORK within your given timeline and budget.

$19 Chat With Writer
Academic Mentor

ONLINE

Academic Mentor

I am an elite class writer with more than 6 years of experience as an academic writer. I will provide you the 100 percent original and plagiarism-free content.

$33 Chat With Writer
Smart Homework Helper

ONLINE

Smart Homework Helper

Being a Ph.D. in the Business field, I have been doing academic writing for the past 7 years and have a good command over writing research papers, essay, dissertations and all kinds of academic writing and proofreading.

$35 Chat With Writer
Assignment Guru

ONLINE

Assignment Guru

After reading your project details, I feel myself as the best option for you to fulfill this project with 100 percent perfection.

$41 Chat With Writer

Let our expert academic writers to help you in achieving a+ grades in your homework, assignment, quiz or exam.

Similar Homework Questions

Berton braley the will to win - Kadri gopalnath saxophone nagumomu - Org and beha - Humanities through the arts 9th edition pdf free - Ethical dilemma powerpoint - Principles of Marketing (2 Modules completed) - Tropicana marketing strategy - +91-8890675453 love marriage problem solution IN Aizawl - South melbourne little athletics - An aptitude for a job implies a - The thinker's guide to the art of socratic questioning pdf - NUR601- Module 4 Assignment - Wireless Technology 3 - Theoretical yield of product - History of public health and community health nursing - Historical lenses and history's value - What is the factored form of 27a 6 8g 12 - DUALISM VS PHYSICALISM - Abcs of z os system programming volume 9 - Ocr ict for a2 - Vout vin op amp - Gre chemistry practice test - MS 2 - Essay rhetorical analysis 3 pages - Facial Recognition and Mobile Apps - Nmb ball bearing catalogue - Art 11 Discussion - What is external stability in economics - Write a Press Release for Me! - 2 papers and a powerpoint - SAP Discussion board - Global business cultural analysis - The oxford solid state basics - Maglev wind turbine project - The far away brothers pdf - Spi 200 futures index - BS DS 1. - Sodium borohydride reduction of 9 fluorenone - Electronic configuration of cu+ - Team project on U-Haul - Short essay about a Bollywood movie - Palo alto url filtering license expired - Mcdougal littell algebra 2 online textbook activation code - Tales from dimwood forest - What type of diversification strategy is amazon pursuing - The concept of subculture refers to - Aldridge & district dog training club - Fact finding techniques in database - Global supply chain management simulation v2 chegg - Zinc and vinegar chemical reaction - Professional nursing practice concepts and perspectives 7th edition apa citation - What are the benefits and challenges associated with business 2.0 - CFI3-2 - Midnight in the garden of good and evil cliff notes - Eastman kodak future - Joe ragland gorilla glue - Written task 1 examples ib - I need help - How many perpendicular lines does a triangle have - South lanarkshire building warrant - Critique Research Assignment. Research of nursing - Alko iq7 wiring diagram - Lab: Viruses - Week 8 Crisis intervention - Wild west produces two types of cowboy hats - This week’s journal articles focus on empowering leadership and effective collaboration in geographically dispersed teams - Queensland kindergarten funding scheme - Finance Assignment - 3400 grams to lbs and oz - Religion assignment 2 - 64 foot organ pipe frequency - Ground state vs excited state electron configuration worksheet - Apple tv target market - Do tarsiers have a dental comb - Challenges in the Business Environment - Into thin air comprehension questions - Reynolds adaptable intelligence test iq - Two metal balls are the same size - Discussion / Answer 2 questions and comment on 2 students / 200~300 words for answers / need in 24 hours / required outside sources - Week8 MA215 - Owl pellet identification chart - The table gives the values of a function obtained from an experiment. use them to estimate - Linon sarah five drawer padded chest cabinet - EDU-534 Lesson Plan - The three principles of economics include optimization, equilibrium, and empiricism. - Structure of oedipus rex - Explain what the barriers are to interpersonal communications - Deliverable 06 R&W - Fire hydrant paint specification - Donkey kong country 2 102 percent - Metro water complaint number - Two pellets each with a charge of - Nurse practitioner core competencies 2016 - Chocolate diamond rings at jc penneys - One Page Response to book "Brave New World" - Divine right of kings macbeth - Nix it company's ledger on july - Plant cell rap lyrics - Advocacy and opposition rybacki & rybacki 7th edition pdf - General Biology 4