To research if college holidays in weeks 8, 9 and 10 led to giant outbreaks within the spring and may clarify subsequent unfold of Covid-19, we first accumulate information on college holidays throughout Europe. The primary supply on college holidays is Eurydice community11established by the European Fee, which collects yearly data on the construction of the varsity 12 months in Europe. That is then cross-referenced with different sources, see information Appendix A.
A number of datasets on the unfold of Covid-19 are used. First, we use information that has been collected and harmonized on case counts of Covid-19 on the NUTS 3 stage for quite a lot of European international locations9. The NUTS classifications are a hierarchical system used within the EU primarily based on administrative borders. NUTS 3 are small areas, NUTS 2 are bigger areas, whereas NUTS 1 are main socio-economic areas. For the primary a part of the evaluation, I exploit Covid-19 case numbers from 650 areas in 14 European international locations. Secondly, information from RKI Germany is used for Covid-19 associated deaths. See Appendix A for extra data on information and Appendix B for extra particulars on the collection of international locations.
To evaluate if the timing of faculty holidays is essential for the unfold of Covid-19, we run an ordinary OLS regression to estimate Eq. (1). On this regression we attempt to clarify the variety of instances of Covid-19 per NUTS 3 area (ln variety of instances) with a single joint dummy variable (break) for areas which have college holidays in weeks 8, 9 or 10.
$$beginaligned ln(instances)_r = beta _1 break_r + region_r + CD_c +epsilon _r endaligned$$
(1)
I run a cross-sectional regression individually for every month (11 regressions) to analyze not provided that late college vacation (weeks 8, 9 or 10) have been essential for the preliminary publicity, but additionally persistence over time. The instances are first aggregated to 7-day intervals after which to the month-to-month stage, which roughly correspond to a month. Numerous NUTS 3 particular management variables ((region_r)) from Eurostat are added. These embody a categorical variable on urbanization (three classes predominantly city, intermediate and predominantly rural), inhabitants, regional earnings, space(km sq.), median age and proportion of individuals under 14 and share above the age of 60. The inclusion of those variables controls for the cross-region demographics and importantly variation in inhabitants density when it comes to variety of inhabitants, typology (urban-rural) and geographic measurement. Errors are clustered on the NUTS 2 stage.
Recall that in Eq. (1) we embody dummy variables for areas which have a faculty vacation in both week 8, 9 or 10 (break equals 1 if area had a faculty vacation in both week 8, 9 or 10). In observe, which means that we’re evaluating areas that had a break in these greater publicity weeks to areas that had a faculty vacation break in week 7 or earlier (controlling for regional variables as famous above). As well as, we add a rustic particular dummy ((CD_c)) to the regression. As we all know, testing methods differ considerably between international locations and the inclusion of a rustic particular impact accounts for such variations. By utilizing a rustic particular dummy, we’re successfully utilizing variation inside a rustic to determine the consequences. Word that international locations with the identical college vacation profile should expertise a variation within the general stage of instances coming in to the nation, stemming from the nation particular propensity to journey overseas through the college vacation. See additionally a dialogue in Appendix C (Desk C1) on journey patterns to the identified hot-spots within the alps in February and March 2020. Because the response within the spring was principally nation particular, containment coverage shouldn’t play a big position within the relative distribution of instances inside a rustic. The nation particular fastened impact will, for instance, seize nation particular lockdowns or different containment insurance policies.
The identification technique employed makes use of the truth that college holidays are determined lengthy prematurely and the precise timing is of course exogenous to the unfold of Covid-19 in February/March 2020. As well as, you will need to spotlight two additional elements. First, through the use of nation particular results ((CD_c)) we exploit solely regional within-country variation. International locations with out variation within the timing of faculty holidays will subsequently not contribute to the estimate of (beta _1) (Belgium for instance solely had a vacation in week 9). Second, regional traits ((region_r)) are added to account for noticed regional variations (e.g. density, age profile). Therefore, we examine areas which had a vacation in week 8 or later (“handled”) to areas, throughout the identical nation, that had a vacation in week 7 or earlier than (“controls”) after accounting for noticed nation and regional stage variations. Utilizing Germany for instance, we examine areas in Bavaria (week 9) and Hamburg (week 10) to Berlin (week 6) and areas in North Rhine-Westphalia (no college vacation) after controlling for regional traits. To summarize, I argue that given the exogeneity of the precise timing of holidays, and controls for noticed variations, we’re capable of infer the position of faculty holidays on the preliminary unfold of Covid-19.
Determine 4 exhibits the OLS estimation outcomes from these eleven month-to-month regressions. We are able to see clearly that areas with a faculty vacation in week 8, 9 or 10 had a significantly greater unfold of Covid-19 in March-April. Quantitatively, the distinction between late and early college vacation areas is giant. We see that the variety of instances in March is round 60% greater for areas with a faculty vacation in both week 8, 9 or 10 (in comparison with areas throughout the identical nation with a faculty vacation in week 7 or earlier than).
A pure extension is to analyze if particular week numbers matter greater than others, as we could anticipate from the timeline offered above. We are able to alter Eq. (1) and as a substitute of a single post-week 7 dummy, embody separate week particular dummies (break8 for week 8 and many others.) as proven in Eq. (2).
$$beginaligned ln(instances)_r = beta _1 break8_r + beta _2 break9_r +beta _3 break10_r + region_r + CD_c +epsilon _r endaligned$$
(2)
We estimate the equation with OLS as earlier than utilizing the identical management variables. The coefficients for the week particular vacation break dummies are proven in Fig. 4b–d. We are able to see that the preliminary influence is strongest in areas with a faculty vacation break in week 9 (over 90%) secondly in week 10 (50–95%) and lowest in week 8 (35%). As earlier than, these outcomes are relative to areas that had a break in week 7 or earlier than. Full outcomes can be found in Tables E5 and E6. It needs to be famous that, independently and parallel to this work, one other research12 has used the variation in winter college vacation weeks to analyze coverage effectiveness and the influence on extra deaths within the spring of 2020. Equally, they discover that week 9 college vacation led to a considerable share of the surplus deaths in week 9 areas.
In some instances the varsity vacation differ inside a NUTS 3 areas (e.g. Netherlands, Denmark and Sweden). Such variation will are inclined to attenuate the outcomes offered above and bias the coefficients to zero, our outcomes could subsequently present an underestimate/decrease certain of the true impact in such instances.
One potential amplifying issue for areas with holidays in weeks 9 and 10 is that since Covid-19 had been launched to a rustic by week 8 (9) vacationers, worldwide journey is probably not wanted, solely journey to different contaminated areas throughout the nation. Therefore, home journey to newly contaminated areas with college holidays in earlier week(s) could have amplified the influence for areas with breaks in weeks 9 and 10. The unfold may also be much less clustered over time in Europe (e.g. away from ski-resorts) and journey to different locations could turn out to be dangerous at this level. Examples of such occasions embody a world enterprise convention in Boston throughout week 9 (February 26–27)13. From analyzing genome sequences, some instances of Covid-19 have been possible transmitted from Denmark to Sweden in March 20206. That is in keeping with college holidays doubtlessly being essential for the reason that breaks in Denmark are principally in weeks 7/8 whereas the receiving areas are principally in northern Sweden (week 10). The outcomes are additionally strong to including gravity variables associated to Ischgl and different associated robustness checks. See Appendix C for a dialogue.
Persistence
Above we’ve got established that faculty holidays in weeks 8, 9 and 10 are a robust indicator of huge outbreaks of Covid-19 within the spring of 2020. The second principal goal of this paper is to point out how preliminary publicity remains to be related through the fall/winter of 2020. We are able to see from the Fig. 4a that regardless of the obvious disappearance of the preliminary outbreaks, the influence re-appears after the summer time holidays. This may be seen from the re-emergence of a major vacation week dummy from September and on-wards. On common, the unfold is 30–50% greater in areas with excessive chance of preliminary publicity (vacation in week 8, 9 or 10).
Investigating the persistence individually by college vacation week, we will see a sign that the persistence is proportional to preliminary publicity, with the earliest and strongest resurgence in areas with college breaks in week 9 (40–70% greater). This means that the scale of the outbreak impacts the diploma of latent unfold in an space, driving the systematic re-resurgence of Covid-19 within the fall of 2020. Giant outbreaks will subsequently have long-lasting penalties on neighborhood unfold, since it could be troublesome to seize or suppress the underlying (latent) unfold absolutely.
The International Initiative on Sharing All influenza Information (GISAID)14 present a nomenclature system for genome sequences of SARS-CoV-2 that clusters strains/variants primarily based on their genetic relatedness into eight principal clades15. Investigating the broad clades within the GISAID database, we will see that the clade that dominated in a area within the spring 2020 tends to be the most important even after the summer time, indicative of persistence. We are able to for instance see that each within the spring and early fall the GH clade has dominated in Northern-America adopted by, however to a a lot lesser extent, the G clade. That is in stark distinction with Europe have been the GR clade dominates which accounts for a really low share of instances in Northern-America. Moreover, in Europe, G and GH clades had a big share each within the spring and after the summer time. In South America, the GR clade has dominated from the beginning. In late 2020, GISAID fashioned a brand new GK clade, consisting of the Delta variant (B.1.617.2). Word nevertheless that the pattern interval of this research ends in January 2021, which is effectively earlier than the Delta variant (B.1.617.2) began to dominate in Europe.
One could also be fearful about regional containment measures impacting these outcomes. Nonetheless, you will need to notice that such insurance policies are usually skewed to areas with excessive ranges of unfold. As proven earlier than, these are the areas that had holidays in weeks 8, 9 and 10. Regional containment insurance policies would subsequently are inclined to bias the outcomes to zero, and the persistence would subsequently be stronger if not for such insurance policies. Numerous laborious hit international locations/areas began to re-introduce stricter containment insurance policies in September and October. Whereas native authorities in Germany have autonomy in imposing restrictions, the German federal authorities put in place a trigger-based system have been native authorities have been suggested to think about imposing lockdowns if new instances went above 50 per 100 thousand residents16. Germany launched a nationwide “lockdown gentle” from the start of November and a full nationwide lockdown from mid-December. Word that two giant areas, Bavaria (week 9), and Saxony (week 8) went right into a stricter lockdown previous to the complete lockdown which got here into impact on December sixteenth. Within the Netherlands, stricter measures have been introduced on September 18th for six safety areas (of 25). All of which had (primarily) college holidays in week 9. On September twenty fifth, eight extra areas acquired the tighter measures, six of which had holidays in week 9, two in week 8. Quickly thereafter, extra nationwide measures have been launched, a partial lockdown from mid-October and a full lockdown from mid-December. This gives indications that the persistence could also be underestimated attributable to focused regional coverage in laborious hit areas. A profitable nationwide lockdown would additionally possible have a tendency to scale back the general stage of the unfold and therefore noticed regional distinction, equally to the convergence over the summer time.
Whereas the geographic footprint that the varsity holidays 2020 left behind remains to be seen within the fall/early winter, we might anticipate the systematic variations to subside as native outbreaks happen in new places over time, attenuating the preliminary publicity. Therefore, outbreaks in initially low publicity areas would additionally are inclined to result in convergence over time and decreased significance of the varsity holidays. Investigating the outcomes additional, we will see how the influence of preliminary publicity decreases in December 2020 and January 2021 just like the summer time of 2020. That is in keeping with that intensive containment measures had been in place for a considerable interval within the largest international locations within the pattern (Germany and the Netherlands, see dialogue above) and the introduction of different (doubtlessly) extra contagious strains which might naturally be unrelated to preliminary publicity.
Equally, in an ordinary epidemiological SIR mannequin, for instance, infectious people go on the virus to the prone inhabitants. Over time, as increasingly more individuals get contaminated, the inhabitants of prone people reduces, ultimately slowing down the unfold till herd-immunity is reached. In our setting, this might ultimately result in convergence between the initially laborious hit areas and people much less uncovered, as herd immunity needs to be reached earlier within the extremely uncovered areas. Nonetheless, so long as the share of individuals which might be immune is pretty low, convergence between high- and low uncovered areas would solely happen progressively. To achieve herd-immunity, a large share of the inhabitants must be resistant to Covid-19, both by vaccination or antibodies. A latest research estimated that over 60% (and as much as 90%) of the inhabitants could be wanted to succeed in herd-immunity17. A Spanish nationwide research of over 51,000 people, carried out in November 2020, confirmed nevertheless that the share of individuals with antibodies was solely 10% nationally and beneath 19% within the hardest-hit areas (see the ENE-COVID venture18 and the fourth spherical outcomes19). As Spain has skilled comparatively giant outbreaks of Covid-19 (see Fig. 1), it strongly means that the share of immune people remains to be effectively under the degrees wanted to succeed in herd immunity in each Spain and the remainder of Europe through the interval of curiosity.

Coefficient plot of the joint dummy per 30 days in graph (a) from March 2020 to January 2021. Sub-graphs (c), (b) and (d) present week 8, 9 and 10 dummies (in a single regression with out the joint dummy). The coefficients within the diagrams have been remodeled ((e(beta _1-1)) for simpler interpretation.
Covid-19 associated deaths
It’s effectively established that the variety of Covid-19 exams varies between international locations2. For our identification technique, such cross-country variation in testing is captured by a rustic particular fastened impact ((CD_c)). Regional variation inside a rustic could nevertheless influence the outcomes if the variety of exams is disproportionately greater in areas with a faculty vacation in weeks 8, 9 and 10. This might for instance be attributable to higher public entry to testing in these areas. Utilizing the varsity vacation weeks to determine areas with excessive chance of preliminary publicity is a energy of the research as we will overcome potential issues associated to restricted preliminary testing capability in some international locations (importantly not Germany) or in comparatively extra rural areas. Word additionally that if testing capability was restricted and, for instance, regional capability was reached throughout March/April this may increasingly are inclined to bias the outcomes downward because the restrict could be reached first within the high-exposure areas.
This dialogue underscores the significance of exploring the robustness of the outcomes utilizing different Covid-19 associated outcomes. To research if (within-country) regional variation in testing impacts our outcomes, we will use different outcomes which don’t depend on public entry, such because the variety of Covid-19 associated deaths. To research this, we use information from Germany. The German RKI publishes information on the variety of instances and Covid-19 associated deaths for every NUTS 3 area. Germany is especially effectively fitted to this goal attributable to geographic and inhabitants measurement, giant variety of NUTS 3 areas, wide-spread testing from early phases and variation in timing of faculty vacation. Germany had homogeneous regional guidelines for testing20 and good relative preliminary testing capability21. It’s subsequently pure to analyze it additional. We are able to see from the sub-plots of Fig. 5, that the broad patterns are the identical. Utilizing the joint dummy mannequin or separate week dummies (week 8 or week 9/10 mixed) the outcomes present the identical tendencies. Full outcomes can be found in Tables E8, E9, E10 and E11 within the Appendix.

Germany solely: outcomes for the variety of instances and deaths (joint week 8/9/10 dummy or individually week 8 and week 9(10) dummy. Sturdy normal errors clustered at NUTS 2 stage.
City or rural persistence
Urbanization is usually mentioned in relation to the unfold of Covid-19 and plenty of cities have skilled giant outbreaks (e.g. New York, Madrid, Stockholm). A priority could subsequently be that the end result are pushed by city areas with excessive preliminary publicity. To research if and what position urbanization performs for the persistence of Covid-19 we add an interplay time period for the joint vacation break dummy (week 8, 9 or 10) and our categorical variable for urbanization. The outcomes present that the diploma of persistence is stronger in extra rural areas. It needs to be famous that the stage of the unfold of Covid-19 is greater in city areas, however from the graphs we will see that the persistence associated to preliminary publicity is greater (as may be seen from a major interplay within the fall). The persistence is roughly 30% greater in intermediate city areas and over 50% greater in rural areas within the fall. This means that Covid-19 persists even in smaller, extra distant settings that skilled excessive preliminary publicity, and never solely in city settings. See Fig. 6 and Desk E7. A possible clarification could possibly be that containment insurance policies have been extra focused at city areas, and underestimated the potential persistence of Covid-19 in comparatively rural communities.

City/rural variations. Interplay of urbanization dummy and the joint week 8, 9 and 10 dummy. Outcomes are remodeled as earlier than.