24 Apr 2020
3 MINS READ
The first visual representation and analysis is Path Analysis using Sankey Diagram. WHO China Joint Mission released a report based on its study on novel coronavirus disease (COVID-19).
One of the key information in the report is the visualization of the pattern of the disease progression based on the laboratory confirmed cases.
Let us try and understand the analysis (Figure 1),
This single visualization has the power to provide information related to the disease progression (stages), the outcome within stages, the no of people who have recovered/died within the stage and the trajectory through combination of multiple colors. The report and definition of the stages can be accessed here.
Figure 1: Pattern of disease progression for COVID-19 in China based on 55924 laboratory confirmed cases. Source: WHO China Joint Mission on COVID-19 Report
The second interesting visualization and analysis technique is Graph Analysis. Let us look at how Nebula Graph, created by an open source Graph Database company and coding academy at Singapore, built a network map of COVID-19 cases.
In case of Nebula network graph (Figure 2), it looked at data on how five people became infected with the novel coronavirus in just one city, Tianjin. The network was loaded with data of people who are either healthy or sick, based on the physical address that those people travelled to. The data eventually helped to traceback the known carrier of the novel coronavirus. Though the numbers are too small in this case, such visualization proves to be an excellent method to track contact.
The network graph created by the Singapore based company (Figure 3) focused on using the data to visualize the degree of interconnectedness between cases and infected clusters within Singapore. Each node represents an infected person and the edge represents the transmission of the contagion through a known contact.
One such similar contact tracing network graph for India can be found at covid19india website.
The third visualization technique is a Dendrogram (Hierarchical Clustering). NEXTSTRAIN is an open-source program for the real-time tracking of pathogen evolution such as COVID-19.The “Genomic epidemiology of novel coronavirus” such as Phylogeny, Transmission and Diversity are tracked, analyzed and visualized by NEXTSTRAIN (Data from “Global Initiative on Sharing all Influenza Data”).
So how do I interpret a dendrogram?
What we are looking at is a tree diagram/layout showing hierarchical clustering i.e. relationships between similar sets of data.
Technically a branch is called as “Clade”. Clades are arranged according to how similar (or dissimilar) they are. Clades that are close to the same height are similar to each other, clades with different heights are dissimilar – the greater the difference in height, the more the dissimilarity.
For an interactive visualization you can visit: https://nextstrain.org/ncov
1. Understanding the numbers in the given context – The key difference between 500 diseased individuals in a country with total population of 50+ million and same 500 infected people in a country with population of 1 billion can make a huge impact.
2. Understanding the numbers with relation to the period: The differences between current numbers and projections must be clearly highlighted to avoid spread of fear and panic.
3. Understanding the domain related boundaries: The difference of analyzing data from a mathematical/analytical standpoint and not from a domain expert’s perspective. Epidemiology is a field unto itself with serious consequences.
4. Understanding the ethical considerations: The difference between presenting visualizations through interpreting data and presenting an induvial opinion that can be biased must be clearly understood in order to avoid stigmatizing.
5. Understanding the demographics: The difference between generic data and demographic specific data must be understood to present the visualizations effectively.
6. Understanding the human nature: The thin line between obligation of presenting the data and facts and the effect it can have on the country or risk-group who are undergoing self- isolation/dread must be understood.
Data visualization has played a key role in understanding the spread and impact of COVID-19. Have you come across any cooler visualization that you know have visualized and tracked COVID-19?
Stay Safe and follow Social Distancing…
About the Author
Ranganathan Rajkumar is a Project Director in BI&A for Big Data at Hexaware Technologies. He has around 19 years of experience combining technologies from Speech Recognition, IVR/VRU, Big Data, Artificial Intelligence and Machine Learning. He has helped many organizations to adopt a data-driven culture by helping them built Big Data and Analytics CoE. He has helped organization to adopt AWS, Azure and GCP cloud by architecting end to end environment for Data Migration, Big Data Analytics and AI/ML model pipeline. Ranganathan is also a keen industry follower of advancement and research in the field of AI, Deep Learning, NLP and Computer Vision.
BI & Analytics
13 Nov 2020
07 Sep 2020
11 Jun 2020
28 May 2020
08 May 2020
13 Apr 2020
06 Apr 2020
31 Mar 2020
26 Mar 2020
23 Jun 2017
06 Aug 2015
13 Jul 2015
28 Oct 2014
17 Apr 2014
24 Mar 2014
22 Jan 2014
20 Dec 2013
01 Nov 2013
26 Sep 2013
03 Sep 2013
26 Aug 2013
29 Apr 2013
04 Mar 2013
21 Feb 2013
04 Feb 2013
03 Jan 2013
26 Nov 2010
19 Mar 2009
Digital Assurance
02 Jan 2012
17 Feb 2012
Infrastructure Mgmt. Services
02 Mar 2012
06 Feb 2013
Digital Assurance, Enterprise Solutions
14 Feb 2013
18 Feb 2013
27 Feb 2013
Others
01 Mar 2013
Enterprise Solutions
05 Mar 2013
18 Mar 2013
Digital Assurance, Enterprise Solutions, Others
22 Mar 2013
12 Apr 2013
26 Apr 2013
13 May 2013
11 Jun 2013
17 Jun 2013
25 Jun 2013
19 Aug 2013
27 Aug 2013
10 Sep 2013
19 Sep 2013
24 Sep 2013
30 Sep 2013
01 Oct 2013
03 Oct 2013
19 Nov 2013
Enterprise Solutions, Manufacturing and Consumer
28 Nov 2013
03 Dec 2013
03 Jan 2014
27 Jan 2014
31 Jan 2014
12 Feb 2014
13 Feb 2014
20 Mar 2014
11 Jun 2014
Manufacturing and Consumer
26 Jun 2014
30 Jun 2014
10 Jul 2014
15 Jul 2014
16 Jul 2014
18 Jul 2014
26 Aug 2015
28 Sep 2015
07 Oct 2015
26 Oct 2015
07 Mar 2016
22 Mar 2016
13 May 2016
23 May 2016
Application Transformation Mgmt.
11 Jul 2016
25 Aug 2016
03 Sep 2016
14 Sep 2016
15 Nov 2016
22 Nov 2016
25 Nov 2016
Business Process Services
25 Apr 2017
Banking and Financial Services
18 May 2017
30 May 2017
27 Jun 2017
18 Jul 2017
26 Oct 2017
Healthcare, Insurance
28 Nov 2017
11 Dec 2017
25 Jan 2018
21 Feb 2018
14 Mar 2018
( Mandatory field * )
The information you provide will be used in accordance with our terms ofPrivacy Policy
Please Check on "I Agree" to register for the blog.