International Journal of Coronaviruses

International Journal of Coronaviruses

International Journal of Coronaviruses

Current Issue Volume No: 1 Issue No: 4

Review Article Open Access Available online freely Peer Reviewed Citation

Models and data Analysis of the Outbreak Risk of COVID-19

1School of Information and Mathematics, Yangtze University, Jingzhou, Hubei, China.

2Hospital, Hubei University of Technology, Wuhan, Hubei, China.

3School of Science, Hubei University of Technology, Wuhan, Hubei, China.


With the spread of the new coronavirus around the world, governments of various countries have begun to use the mathematical modeling method to construct some virus transmission models assessing the risks of spatial spread of the new coronavirus COVID-19, while carrying out epidemic prevention work, and then calculate the inflection point for better prevention and control of epidemic transmission. This work analyzes the spread of the new coronavirus in China, Italy, Germany, Spain, and France, and explores the quantitative relationship between the growth rate of the number of new coronavirus infections and time. In investigating the dynamics of a disease such as COVID-19, its mathematical representation can be constructed at many levels of details, guided by the questions the model tries to help answer. Mathematical sophistication may have to yield to a more pragmatic approach closer to the ability to make predictions that inform public health policies.


In December 2019 , the first Chinese patients with pneumonia of unknown cause is China admitted to hospital in Wuhan, Hubei Jinyintan , since then, COVID-19 in the rapid expansion of China Wuhan, Hubei, in a few months time, COVID-19 is Soon it spread to a total of 34 provincial-level administrative regions in China and neighboring countries, and Hubei Province immediately became the hardest hit by the new coronavirus. In an emergency situation, we strive to establish an accurate infectious disease retardation growth model to predict the development and propagation of COVID-19, and on this basis, make some short-term effective predictions. The construction of this model has Relevant departments are helpful for the prevention and monitoring of the new coronavirus, and also strive for more time for the clinical trials of Chinese researchers and the research on vaccines against the virus to eliminate the new corona virus as soon as possible.


According to the original data change law, Establish a Logistic growth model, we collect and compare and integrate the spread of COVID-19 in China, Italy, France, Spain and Germany, record the virus transmission trend among people in each country and the protest measures of relevant government departments.


Based on the analysis results of the Logistic model model, the Logistic model has a good fitting effect on the actual cumulative number of confirmed cases, which can bring a better effect to the prediction of the epidemic situation and the prevention and control of the epidemic situation.


In the early stage of the epidemic, due to inadequate anti-epidemic measures in various countries, the epidemic situation in various countries spread rapidly. However, with the gradual understanding of COVI D -19, the epidemic situation began to be gradually controlled, thereby retarding growth

Author Contributions
Received 13 May 2020; Accepted 02 Sep 2020; Published 03 Jan 2021;

Academic Editor: Jose Luis Turabian, Health Center Santa Maria de Benquerencia Toledo, Spain.

Checked for plagiarism: Yes

Review by: Single-blind

Copyright ©  2021 Jinming Cao, et al.

Creative Commons License     This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Competing interests

The authors have declared that no competing interests exist.


Jinming Cao, Xia Jiang, Bin Zhao (2021) Models and data Analysis of the Outbreak Risk of COVID-19. International Journal of Coronaviruses - 1(4):26-32.

Download as RIS, BibTeX, Text (Include abstract )

DOI 10.14302/issn.2692-1537.ijcv-20-3383


After the outbreak of COVID-19 in China, COVID-19 has also erupted in other countries in the world. Among the countries where new pneumonia outbreaks, Spain, Italy, France and Germany are more serious1. As of April 27, Spain, Italy, France and Germany have each accumulated diagnosed 229842 cases, 199414 cases, 165,842 cases, 158758 cases, the new crown pneumonia spread and various measures of everyday life and people's social normal operation had not Estimated impact2.

In fact, there are some urgent problems to be solved regarding the spread of COVID -19 . Can existing interventions effectively control COVID-19? Can you elaborate on the changes and development characteristics of each epidemic situation? Can you combine the conclusions found in the comparison of the city / region, actual national population, medical level, traffic conditions, geographic location, customs and culture, and anti-epidemic measures ? What mathematical model can we build to solve the problem?

COVID-19 is a new coronavirus discovered in December 2019. The epidemic data is not sufficient, and clinical methods such as clinical trials are still in the exploration stage. So far, the epidemic situation data is difficult to apply directly to the existing mathematical model. The problems to be solved are: how effective the existing emergency response is and how to invest medical resources more scientifically in the future. On this basis, this article aims to study the shortcomings of this part 3, 4, 5.



We obtained epidemiological data from the Aminer website, the People's Republic of China from January 22 to April 3, and Spain, Italy, France, Germany from February 15 to April 27. This includes data such

as cumulative confirmed cases, cumulative deaths, newly diagnosed cases per day, cumulative number of cured cases, and existing confirmed cases. The relevant input is shown in the Figure 1, Figure 2, Figure 3, Figure 4, Figure 5.

Figure 1.Cumulatively confirmed cases
 Cumulatively confirmed cases

Figure 2.Cumulatively cured cases
 Cumulatively cured cases

Figure 3.Daily new cases
 Daily new cases

Figure 4.Cumulative deaths
 Cumulative deaths

Figure 5.Existing confirmed cases
 Existing confirmed cases

The Model

Based on the collected epidemic data, we tried to find the propagation law of COVID-19 and proposed effective prevention and control methods.

There are generally three methods for systematically studying the spread of infectious diseases. One is to establish a dynamic model of infectious diseases. The second is statistical modeling using statistical methods such as random processes and time series analysis. The third is to use data mining technology to obtain information in the data and discover the epidemic law of infectious diseases. Using the collected data from various countries, this article mainly uses the third method.

In this paper, the growth model of COVID-19 transmission is established , and the prediction effect of the mathematical model on the spread of COVID-19 epidemic is compared.

Based on Logistic Estimated Square Law

The traditional SEIR model can not describe the different developments of the epidemic well. After analyzing the actual situation and the existing data, we have established a more effective infectious disease transmission model. According to the actual situation of the epidemic, we will analyze the relevant data indicators of the five countries (cumulatively diagnosed cases, cumulative deaths, newly diagnosed cases per day, cumulative number of cured cases, existing confirmed cases) to adapt to the current situation of the new coronary pneumonia epidemic in the world propagation. Table 1.

Table 1. The significance of each parameter under the construction of Logistic model
 symbol meaning unit
a,b parameter none
x the time since the outbreak began to count day
 t time day
 y cumulative confirmed cases people
 x0 E^(-t) none
 y0 1/y none
 e constants  none

As can be seen from the data graph, the change in cumulative death toll in Italy over time is a non-linear process. Considering the shape of the scatter plot and the model generally involving the Logistic curve model, here we use the Logistic curve model for fitting. The basic form of the logistic curve model is:

y = 1 / (a + be ^ (-t))

Therefore, we need to transform this nonlinear process into a linear model after data processing. Take x0 = e ^ (-t), y0 = 1 / y; Then the original model is converted to a linear model y0 = a + bx0.


Since COVID-19 has been developing in Italy for a long period of time, and the cumulative number of confirmed cases is relatively large, the data is more convincing, so here we take the cumulative number of confirmed cases in Italy from February 15th to May 3rd The nonlinear model becomes a linear model, and matlab is used for fitting linear regression analysis. Matlab source code is as follows 6, 7, 8, 9:

x = 1: 1: 27;

y =


plot (x, y, 'r *'); xlabel ('time')

ylabel ('population') x0 = exp (-x);

y0 = 1. / y;

f = polyfit (x0, y0,1);

y_fit = 1 ./ (f (1). * exp (-0.338. * x) + f (2)); plot (x, y_fit * 1000);

hold on

plot (x, y, 'r *'); xlabel ('time') ylabel ('population')


Logistic Model Estimates

On the basis of the cumulative number of confirmed cases in Italy from February 15th to May 3rd, we used Matlab to establish a Logistic model and performed linear regression analysis. Using the above processing, we can get the predicted cumulative number of confirmed cases in Italy as shown in Figure 6.

Figure 6.Comparison of actual cumulative confirmed cases and simulated cumulative confirmed cases
 Comparison of actual cumulative confirmed cases and simulated cumulative               confirmed cases

As shown in Figure 6, we can conclude that the Logistic model has a good fitting effect on the actual cumulative number of confirmed cases, thus providing reference value for departments and hospitals at all levels to effectively intervene and prevent the spread of new coronavirus in the next few days.


The spread of COVID-19 is affected by many complex factors. In the early stage of the transmission of COVID-19, it is difficult to establish a Logistic model and parameter estimation and obtain a fairly accurate simulation result, but the initial estimated parameters such as the growth rate of the confirmed cases and the possible cumulative maximum confirmed cases can be obtained through existing data. It is helpful to solve important parameters such as infection rate and recovery rate, which will help us to grasp the transmission trend of COVID-19 more accurately.


Promotion of the model: The SEIR model based on 2019-nCoV can be established. The SEIR model is superior to the logistic model in trend prediction, but due to the many parameters to be considered, the calculation error is greater than the logistic model 10, 11, 12, 13, 14, 15, 16, 17, 18, 19.

A dynamic growth rate model based on 2019-nCoV can be established. The dynamic growth rate model has a good fitting effect, but has a certain error. You can also optimize on the value of r. The methods of optimizing r are: 1. Perform grid optimization; 2. Perform bipartite optimization; You can optimize on the value of K and update in real time.

After the turning point of the epidemic situation, that is, the fitting effect of the reducer and the saturation period is poor, and even a large error occurs 20, 21, 22, 23.


This work was supported by the Philosophical and Social Sciences Research Project of Hubei Education Department (19Y049), and the Staring Research Foundation for the Ph.D. of Hubei University of Technology (BSQD2019054), Hubei Province, China.


  1. 1. (2020) .
  1. 2. (2020) .[Accessed 01
  1. 3. (2020) .[Accessed 28
  1. 4. (2020) . biz=MzU0MjY4NzU5Nw==&mid=2247485726&idx=5&sn=3cd57623b677 5666b53f6b98497208e1&chksm=fb17a11ccc60280a14b139c45408d6af48184ea94aa942bbd711380b271ba38 2c6e4197a3b13&mpshare=1&scene=23&srcid=0423m9tmCjW9lEgP9Wtbju6G&sharer_sharetime=1587641 792708&sharer_shareid=b8fc3d4ec515cc6c342bf7bec53c2d7d#rd [Accessed 26 .
  1. 5. (2020) .
  1. 6. (2020) . [Accesse d On Feb 29.
  1. 7. (2020) . CommendFromMachineLearnPai2-2.& depth_1utm_source=distribute.pc_relevant.none-task-blog-BlogCommendFromMachineLearn2 .
  1. 8..[Accessed 07 Feb2020];
  1. 9. (2020) . [Accessed 07
  1. 10. (2020) . , BlogCommendFromMachineLearnPai23&depth_1utm_source=distribute.pc_relevant.none-task-blog-Blog CommendFromMachineLearnPai2-3 [Accessed 07.
  1. 11. (2020) .
  1. 12. (2020) . [Accessed 30 .
  1. 13. (2020) . National Health Commission of the People’s Republic of China.[Accessed 08 .
  1. 14. (2020) . National Health Commission of the People’s Republic of China.[Accessed08 .
  1. 15. (2020) .
  1. 16.Ming W, Huang J, CJP Zhang. (2020) Breaking down of healthcare system: Mathematical modelling for controlling the novel coronavirus (COVID-19) outbreak in. , Wuhan, China[J]. bioRxiv 12(1), 627-630.
  1. 17. (2020) Health Commission of Hubei Province.
  1. 18.Chan J FW, Yuan S, K H. (2020) A familial cluster of pneumonia associated with the2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster[J]. The Lancet.
  1. 19. (2020) . National Health Commission of the People’s Republic of China.[Accessed08 .
  1. 20.Huang Chaolin, Wang Yeming, Li Xingwang. (2020) Clinical features of patient sinfected with 2019 novel coronavirus in Wuhan, China[J]. The Lancet. 395(10223), 497-506.
  1. 21.B L Haagmans, H Al Dhahiry S, C B Reusken. (2014) Middle East respiratory syndrome coronavirus in dromedary camels: an outbreak investigation[J]. Lancet Infect Dis. 14, 140-145.
  1. 22.E I Azhar, S A El-Kafrawy, S A Farraj.Evidence for camel-to-human transmission of MERS coronavirus[J]. , N Engl J Med 2014, 2499-2505.
  1. 23.Zhou P, Fan H, Lan T. (2018) Fatal swine acute diarrhoea syndrome caused by an HKU2-related coronavirus of bat origin[J]. , Nature 556, 255-258.