Google Sheets can be used to calculate descriptive statistics for small data sets. However, with large data sets we recommend using statistical software made for analyzing large data sets, such as R, Tableau, SAS, STATA, SPSS, and other such software.
In the video below, we show how to calculate these seven descriptive statistics.
Coefficient of Variation
In conclusion, Google Sheets can be used to quickly and conveniently calculate descriptive statistics like the seven discussed in this article when working with small data sets.
Adding filters in Tableau helps you make your Tableau dashboards more useful. Filters allow you to display subsections of your data set in your dashboard.
One way to add a filter is to use one worksheet that you have in the dashboard as a filter for the other worksheets. To do this, select the filter icon in the menu that appears to the right of the worksheet when you have that worksheet selected.
Another way is to go to the drop down menu to the right of the worksheet, hover over “Filters”, which then brings up a drop down menu in which you can select which of the variables in the worksheet you want to use to filter by.
The video below shows how to do both of these methods for adding filters in less than 3 minutes.
As shown in the above video, adding filters to your Tableau dashboards can make your dashboards more useful and easier to display a selected subsection of your data set.
When using of Google Sheets, you may wonder if it’s possible to use SQL query language to pull up data in Google Sheets, especially when working with large data sets. The answer is yes, it is possible!
Here is a video below giving examples using the below formula with three different types of Google Sheets sample data sets created by RILLIAN.
=QUERY(data, query, headers)
Three Minute Tuesday are a video series by RILLIAN, a consulting agency, briefly, in approximately 3 minutes or less, cover a topic related to work involving Research & Insights Leading to Learning, Innovation, And actioN (R.I.L.L.I.A.N.).
One sample data set is an e-commerce store sample data, the second is an epidemiology infectious waterborne disease and exposure sample data set, and the third is a sample videographers log data set.
One type of data that this is useful with is e-commerce data. E-commerce data, like the sample data set shown in this video, has tons of information, all of which is useful in the operations of the store, but looking at all of it can be a little overwhelming. Using the SQL query code above, specific data that you want to look at can be pulled into a separate sheet in the Google Sheets workbook. As you can see in the video, using SQL to do a data query makes it easier to see in which states a certain product, Baby Yoda t shirts, are selling the best in.
Another example is querying data from an epidemiological sample data set. This data set has 2,000 rows with two columns of data. The first column has data on if the person has a confirmed diagnosis of a waterborne disease or not. The second column has data on exposure; if the person had the exposure of swimming in a body of water of interest in this study or if they did not have the exposure. The number one is used to indicate yes and the number zero is used to indicate no.
Using SQL, data can be queried to only pull those who had the diagnosis and had the exposure into a separate sheet in the Google sheets workbook.
Or if you were interested in looking at those who had a confirmed diagnosis but didn’t have the exposure, just change the one to zero, and you can simply adjust the formula and the data query pulls those cases up instead.
A third sample data set is a videographer’s video log. This has all kinds of data that is useful to the videographer, but when just wanting to look at a couple things in the data, such as the take number of a video clip (takes are the number of times that scene was filmed to get it to be just right), it can be useful to do a data query and pull that data into a separate sheet in the Google sheet workbook, as shown in the video.
As you can see, there are many ways this type of data query language can be useful in a variety of different types of data sets in Google Sheets.
Have you used SQL in Google Sheets? What kind of data set did you analyze? Did you find SQL to be helpful for the data set you were working with?
Population data and the percent change in population are two demographic measures that are great to look at for a variety of different reasons.
There are many different ways to visualize these two measures, depending what it is that you want to learn from this data or show to others in a presentation. In this post and in the video we demonstrate two different ways to do so using Tableau.
One way is to display the population data on a map. This is helpful especially if geography is important to what you are working on with this population data. For example if you are presenting different population sizes of different cities to a group of stakeholders who are not familiar with the locations of those cities or how far away or close to each other they are, then visualizing this on a map for them can help show this to them.
The percent change in population can also be visualized on a map. However, a better way to show percent change in population could be a bar graph.
The second way to display population data is with a bar graph, such as the bar graph shown in the video below for population change in 5 East Coast cities of Atlanta, Boston, New York City, Richmond (VA), and Washington D.C.
As you can see in the video, the interactive bar graph created using Tableau makes it easier to see which city had the greatest population change over the two five year periods looked at in this data set.
Three Minute Tuesday Videos by RILLIAN, a consulting agency, briefly cover a topic in approximately 3 minutes or less. These videos are provide brief overviews of topics that are useful for anyone working in the areas of Research & Insights Leading to Learning, Innovation, And actioN (R.I.L.L.I.A.N.).
Are there any helpful tips you would like to add about visualizing population data or population change data in Tableau?
Do you have any suggestions for topics you would like to see us cover in our Three Minute Tuesday videos?
Email Jillian at Jillian.Regan@RILLIANconsulting.com .
Six Sigma is a quality improvement methodology that organizations in many different fields utilize to improve their organization’s processes and performance.
A systematic way for keeping track of Six Sigma projects is essential to being able to effectively use Six Sigma to improve the processes and performance of the organization. It is also important that this system is standardized throughout the organization and that all employees who will be using it are trained on how to correctly use it.
A good system will include a way to decide if the problem identified is enough a problem to be addressed in a Six Sigma project, before proceeding to conducting a Six Sigma project. In addition there should be a way to track both the impact of the problem and solution on the customer (voice of the customer) as well as on the business (voice of the business). It is important to document attributes of the data that is to be collected, such as if it is ordinal scale or nominal scale, to make it easier to keep track of which types of statistical analysis are appropriate to apply to the data.
One way to have a good system for Six Sigma projects could be to use a software program specifically designed for this, such as SixSigma Guide. SixSigma Guide is a software guide designed specifically for Six Sigma Projects.
The homepage for SoftLogic, with SixSigma guide on the far right and two other software programs on the left and middle.
About SixSigma Guide:
It was created by Dr. Reiner Hutwelker
He is a business consultant,
Master Black Belt in Six Sigma,
and an adjunct professor at Hochschule für angewandte Wissenschaften München and Management Center Innsbruck in Eresing, Germany.
Note: this software tool does not provide the statistical analytic capabilities needed in most Six Sigma projects, so a statistical software program like R, SAS, Minitab, or Stata can be used for statistical analysis of the data in conjunction with this software tool
Review of SixSigma Guide:
This software is a useful and practical tool, although it may not be the right fit for every organization. It was clearly developed by a Master Black Belt in Six Sigma who used his many years of practical experience in conducting Six Sigma projects as well as in educating future Six Sigma quality improvement professionals. This software is relatively easy to learn how to use for people who are familiar with other software like Microsoft Excel. One of the best things about it is that it walks the user, step by step through the whole Six Sigma DMAIC (Define, Measure, Analyze, Improve, Control) cycle. Instead of having all these steps and sub-steps saved in different spreadsheets in folders on a shared drive, with this software, everything can be kept track of in one place.
While it is functional and relatively user friendly, I feel that in order to be more marketable the design of the user interface might need to be enhanced. While it is perfectly functional, people have become used to seeing beautifully designed software programs. Large organizations may also wish to be able to customize it with their logos and branding when implementing it across the organization.
A screenshot of the download page to download the SixSigma Guide software. The option to download the free trial is listed first. Both English and German slides with examples of using the software for a Six Sigma project are linked below the download link.
Another way could be to use a combination of different types of software the organization is currently using, such as Microsoft Excel or Google Docs, combined with a file sharing system, such as a server or cloud based storage solution, such as DropBox, so that all stakeholders who need access to the projects can easily access it.
Whichever systems and software you choose to use, it is essential that it there is a systematic and standardized way of conducting and keeping track of Six Sigma projects across your organization.
I’d love to hear your thoughts! Have you used SixSigma Guide software or a similar software for Six Sigma projects? What do you think about it?
Disclaimer: This is an independent review of software, I am not sponsored in anyway by any of the software companies or individuals listed or reviewed in this article. None of the links are affiliate links. I am just sharing my thoughts about software that could be useful for Six Sigma projects. I learned of this software while taking an online course on Six Sigma, from the Technical University of Munich (TUM) in Germany and edX.org, in which Dr. Hutwelker was a guest lecturer.
Jillian Regan, MPH is a consultant at Rillian. She enjoys quality improvement using data to improve processes within the organization, so that the organization can better serve its clients, customers, patients, or others. Connect with her by email at Jillian.Regan@RillianConsulting.com or Twitter (@JillianReganMPH) or LinkedIn.
Follow Rillian on Facebook to get updates on articles like this one or updates about current projects, like a bicycling survey!
In the information age, organizations and departments within organizations often encounter the problem of data existing in silos, where one department or organization cannot easily access data that they need for a project from another department or organization. This can lead to inefficiencies, such as having to stop a project and use time and resources just to retrieve the data needed before the project could continue.
Keep in Mind the Goals for the Data Project
It is important to keep in mind the organization’s problem or goal which the data project needs to measure or address.(1) Focus on that first instead of seeing what data may already be available. While it could be more time and resource intensive, the goal of the project may require new data to be collected because there is not existing data that meets the needs of the project.
Support from the Top and From All Involved
The executive team should be supporting and leading the effort as well as involving the managers of each department.(1) Also, staff involved in all levels of data collection, data entry, and data analysis should be aware of and involved to some degree in the organization’s effort to reduce data silos. Other staff who may not be directly involved in inputting the data should also be made aware of and engaged to some degree in the process of reducing data silos.
Make a New Framework for Data Collection
After breaking down silos, or when collecting new data, make a plan or framework for how the data can be integrated for all the stakeholders so that new data silos are not created.(2) For example instead of separate data files being stored on separate computers, have a shared drive that can be accessed by those who need access to that data across departments or organizations. (3)
Training and Relationship Building
Enhancing knowledge of other’s data needs and increasing trust between people from different departments or organizations is key to breaking down data silos and improving data sharing.
Workshops and learning lunch events are one way to do this. These bring people together to learn about what other departments, organizations, or projects needs, as well as create an atmosphere where people can connect with one another and improve their working relationships. (3)
Documentation is Key
Keep track of data sources, i.e. where the data came from. This is vital to understanding the data. This will also make it easier to know where to go to get updated data as needed. It is also essential to do when having teams of people working on the data project, so that everyone will be on the same page.
Have a system for keeping track of the person or persons to reach out to for specific data and their contact information who may be in a different department, on a different team, or from another organization.
Breaking Down Data Silos Across Different Research Topic Areas
A recent study included a collaboration with multiple sectors: researchers, a children’s hospital, and a police department to use sources of data–spatial video and geographic information systems (GIS)–that was able to provide insights into two different research topic areas–active school transport (AST) and child injury research– in which the data was usually siloed in different research projects.(4)
State Government Initiative for Open Data Portal
The state government of Virginia (in the United States) has an initiative of creating an open data portal, Data VA, of non-sensitive, public information that is made freely available for public use in an easily readable format.(5) At Datapalooza 2017, Virginia Secretary of Health and Human Resources, Bill Hazel, spoke about the need for the different sectors of state government to need to go horizontally across sectors in order to best be able to serve people because no one person whom they serve only fits into only one of the sectors. (6) He also spoke about the need for ethical use of data for a public purpose.
Breaking down data silos can allow organizations or departments within organizations to access data that will provide valuable insights into their organizations. This article presented a few ways in which to do this.
I’d love to hear from you about your ideas! Has your organization or department had success in breaking down data silos? What are some of the ways in which they were able to do this?
Schuch L, Curtis JW, Curtis A, Hudson C, Wuensch H, Sampsell M, Wiles E, Infantino M4, Davis AJ. Breaking Out of Surveillance Silos: Integrative Geospatial Data Collection for Child Injury Risk and Active School Transport. J Urban Health. 2016 Feb;93(1):36-52. doi: 10.1007/s11524-015-0006-9. Retrieved from https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4794455/