Skip to main content

Evaluating the reliability and validity of secondary reporting to measure gender-based violence in conflict and disaster



Accurately identifying the magnitude of gender-based violence (GBV) in humanitarian settings is hindered by logistical and methodological complexities. The ‘Neighborhood Method’, an adapted household survey that uses primary and secondary reporting to assess the prevalence of GBV in humanitarian settings, reduces the length of time and cost associated with traditional surveys. Primary female adult respondents disclose incidents of physical violence, intimate and non-intimate partner rape for themselves, other females in their homes (standard reporting) and other women and children in their social networks (secondary reporting). This study examines the reliability and validity of this inclusion of secondary reporting to determine the comparability of the Neighborhood Method to a traditional survey approach.


Drawing on data from 1180 women reporting on 3744 females in respondent households and 15,086 in neighboring households across four humanitarian settings (Ethiopia/ Somalia, Liberia, Sri Lanka, and Uganda), reliability of secondary reporting was measured through intra-class correlation coefficients (ICCs) and Cohen’s kappas. Validity was assessed using two-sample z-tests for differences between standard versus secondary reporting.


Prevalence estimates comparing a respondent’s household with a neighboring household show closer agreement (ICC: 0.999–0.986) than self-reports vs. secondary reporting on a female counterpoint in a neighboring home (ICC: 0.939–0.98). Kappa statistics analyzing the reliability of two separate neighbors reporting on a third neighbor showed moderate agreement beyond chance alone (κ = 0.45 for physical violence and 0.48 for rape). Prevalence rates corresponded between standard and secondary reports (i.e. showed no statistical difference) in 18 out of 24 compared populations.


For prevalence of GBV, secondary reporting about neighbors can serve as a useful adjunct to standard survey methodology. Findings offer important initial insights into the consistency and accuracy of secondary reporting as a tool for field epidemiologists in humanitarian settings.


Accurate measurement of mortality, violence, and human rights violations in conflict- and disaster-affected populations is critical for informing advocacy, program response, resource allocation, and policy. This is especially true for gender-based violence (GBV), which is known to be prevalent in humanitarian emergencies and is detrimental to the health and wellbeing of vulnerable communities. GBV comprises acts (such as physical, emotional, psychological, or sexual violence) that are perpetrated against a person’s will and are based on unequal distribution of power, particularly gender inequities and norms [1]. GBV encompasses many types of violence, including sexual assault and coercion, physical violence, and intimate partner violence (IPV).

Traditionally, the humanitarian community has relied on qualitative and numerator-based service delivery data to inform programming and policy decisions related to GBV. However, such an approach does not provide a full picture of the scope and magnitude of GBV [2]. There are myriad complexities in collecting high quality population-based data on GBV in humanitarian settings, including ongoing instability, poor access to affected populations, and limited services to support survivors [3,4,5]. In addition to these logistical complexities, there are also numerous methodological challenges to measuring GBV accurately in such settings, such as underreporting due to fear or stigma, inconsistent operationalization of key outcomes, telescoping and issues with recall of past incidents [6,7,8,9,10].

In order to capture GBV data that are as reliable and accurate as possible in disaster-affected populations, we must first consider best practices in measuring this sensitive topic more generally. There is evidence, for example, that data quality is improved by making GBV the exclusive focus of a survey as opposed to embedding questions into broader surveys about reproductive or mental health [11]. Similarly, we know that survey instruments need to be tested and adapted to safely and adequately elicit incidents of GBV in different contexts. Additionally, survey instruments can address barriers like recall and telescoping by using a shorter recall period and identifying important local or national landmark events to help respondents identify when an incident took place [8, 12]. Matching interviewers based on gender and ethnicity has been shown to foster greater trust and rapport between participants and interviewers [12], and allowing for a longer interview schedule can similarly help to build trust and rapport [13]. Importantly, using a conversational, supportive, and non-judgmental style of interviewing can promote participants’ comfort disclosing sensitive information in a way in which they feel supported. For example, slow non-judgmental interviews with male and female couples produced highly consistent reporting of domestic violence among refugees in Jordan [14]. Self-reporting with a tablet or similar electronic device appears to have worked well in some settings [15]. Finally, and most critically, survey teams must consider and work to ensure participants’ safety at every stage of data collection [16].

While there is a reasonably developed body of evidence on good practice around fostering safe and valid disclosure of GBV in survey research, less well documented in the GBV literature are good practices related to sampling approaches. Acknowledging that numerator-based approaches are limited in their ability to ‘tell the whole story’, researchers often resort to the option of an expensive, time-intensive and logistically complex population-based sampling approach. Research into alternative sampling methods better suited to conflict and disaster settings are only beginning to emerge.

One promising approach is secondary reporting (sometimes called indirect sampling or proxy sampling), in which information is systematically gathered about ‘clusters’ of individuals from a respondent who hypothetically knows about the experiences of these other individuals [17]. Secondary reporting offers several potential advantages, including faster and more cost-effective data collection, increased sample size through a single interview, the opportunity to spend more time per interview with a respondent, which in turn reduces non-disclosure bias, and the ethical advantage of limiting the number of interviewees potentially exposed to further trauma or violence triggered by an interview [16]. At the same time, secondary reporting relies on a critical assumption: that informants can and will provide complete and accurate information about the experiences of others [10].

Our own previously published work on GBV employed secondary reporting in internally displaced persons (IDP) camps in Uganda [17], conflict-affected communities in Liberia [2], Somali refugee camps in Ethiopia [18], and conflict and tsunami-affected populations in Sri Lanka [19], but with limited attention to the reliability and validity of the data in comparison to standard self-report. In this article, we explore women’s knowledge and disclosure patterns about experiences of violence and examine whether secondary reporting can assess the magnitude of GBV in a valid and reliable way in conflict and disaster-affected settings.



This analysis uses survey data from 1180 women reporting on 3744 females in respondent households and 15,086 females in neighboring households across the four humanitarian settings named above. Multi-stage cluster sampling was used to select primary respondents for each of the four studies [2, 17, 18, 20]. A trained interviewer approached a selected house and asked to speak to the female head of the household. She explained the purpose of the interview, its anticipated duration, the assurance of anonymity and the need for privacy. If the woman identifying as female head of household gave her informed consent, the interview began in a private location chosen by the respondent. If the woman refused or was unable to speak to the interviewer privately, the interviewer thanked her for her time and moved to the next house identified by the sampling procedure.

Study design

The Neighborhood Method is a population-based approach to measuring GBV that is based on a random sample of adult women reporting on their own experiences of GBV as well as the GBV experiences of others within their social networks [17]. This method was first adapted from the Sisterhood Method, a method for measuring maternal mortality using secondary reporting [21], and our study populations were further adapted in each of our study location sites based on learning from previous sites. Interviewers asked adult female respondents about their own GBV history (standard or self-report) and the experiences of their counterparts in the closest neighboring households (secondary report). In addition to asking about a neighboring adult female, interviewers in Liberia, Ethiopia, and Sri Lanka also asked about the experiences of all other women and children living within the respondents’ own household and the neighbor’s household (See Table 1). Technically, reporting of the prevalence of violence among children or women other than the respondent in the household is a form of secondary reporting, as the respondent is reporting on the experience of others. However, reporting about other members of a respondent’s household is a generally accepted survey methodology for largescale surveys like the Demographic Health Survey and Multiple Indicator Cluster Survey [22,23,24]. This study makes a distinction between this standard approach and the innovative approach of secondary reporting about the experience of other children and women living in a neighboring household. In Uganda, respondents were additionally asked to report on the experiences of their sisters.

Table 1 Summary of methodological features across the four studies

After receiving informed consent from participants, trained local interviewers used a standardized protocol to ask respondents basic questions about their household demographics and those of their closest neighbors (as identified by the interviewer to eliminate potential bias). Interviewers then asked an open-ended question about the ‘biggest challenges facing women and girls in their community’. This question often resulted in respondents initiating a discussion on the topic of GBV and would be prompted later in the interview if not spontaneously raised. The study team drew upon the strengths of qualitative and quantitative methods by using a slow and semi-structured interview schedule to allow for enhanced trust-building between the interviewer and the respondent. The survey instrument measured three distinct forms of GBV: intimate-partner rape (defined as sexual intercourse, or attempted sexual intercourse, without consent by a husband or intimate partner), non-intimate partner rape (defined as sexual intercourse, or attempted sexual intercourse, without consent by someone other than a husband or intimate partner), and physical violence (defined as any act of non-sexual action that resulted in physical harm and was committed with the intent to do harm) [25]. Interviewers were trained to probe on incidents to ensure that they met the case definitions for GBV and used a bounded recall period from 1 year to 18 months. To reduce problems with telescoping, interviewers used important local or national landmark events to help respondents more accurately place the date of their experiences. Survey questions were designed to take on a conversational interview format interwoven with systematic questions about experiences of the population of interest to ensure consistency.

Reliability and validity of secondary reporting

To examine the consistency of patterning and of overall rates of prevalence between primary and secondary reporting, we made three comparisons: incidence of violence self-reported by the respondent vs. (i) incidence reported by the respondent about her neighboring female head-of-household and (ii) incidence reported by the respondent about her sisters (Uganda study only). We also compared (iii) incidence of violence reported by the respondent about other women and children living in her own household vs. incidence reported by the respondent about other women and children in her neighbor’s household.

We assessed the reliability of secondary reporting across all study groups by calculating an intra-class correlation (ICC) coefficient. The coefficient was used to determine the amount of measurement error between secondary reporting using the Neighborhood Method and traditional self-reporting, and to gauge the extent to which secondary reporting could replace or reliably supplement traditional self-reporting. In this situation, we were interested in looking at overall consistency of patterning. To determine which variation of the ICC coefficient constituted the most appropriate measure, we used several pieces of information: for the comparison of self- and secondary reporting, a two-way analysis of variance for the prevalence of gender-based violence was deemed appropriate. The methods (i.e. secondary reporting and self-reporting) are considered “fixed” effects as they are the only methods of interest in this report [26]. The unit of analysis used were individual ratings. The two-way mixed, single measures intra-class correlation coefficient ICC [1, 3] is the best-suited coefficient for reliability analysis; an ICC between 0.9 and 1.0 is evidence for high reliability of the secondary reporting method compared to the self-report gold standard [26], and was used as our standard for measuring high reliability between secondary reporting and self-reporting.

In addition to examining the overall reliability of secondary reporting using the ICC, we also examined reliability at the level of individual interviews using additional data collected in the study of Somali refugees in Ethiopia. In that study, the research team conducted 23 ‘matched’ interviews in which two neighbors were asked to report on the experiences of violence for two common neighbors (Table 2). Cohen’s kappa was calculated to measure the agreement between the matched interviews beyond chance alone, thereby assessing the degree to which one respondent reporting on violence amongst her neighbors agreed with another respondent reporting on violence amongst the same neighbors.

Table 2 Cohen’s kappa statistics for 23 matched interviews about two common neighbors in Kebribeya Camp in Ethiopia

We assessed the validity of the secondary reporting method by comparing results on incidences of violence from this new method with results from self-reporting and reporting about the respondents’ own household. Due to the underreported nature of GBV, it is difficult - even with traditional self-reporting methods - to confirm the extent to which a survey measure ascertains its true prevalence. Without a true ‘gold standard’, we utilized self-report of GBV as a proxy measure. Unlike above, where the analysis explored the consistency of patterning, this analysis assessed correspondence in rates of prevalence between primary and secondary reports. We performed a two-sample z-test to examine differences between proportions and assessed whether the reported prevalence of violence was different between primary and secondary reporting for each study sample and for each form of violence. If standard and secondary reported prevalence failed to show statistically significant differences at the 5% level, secondary reporting was considered to indicate sufficient correspondence in comparison to self-reporting or reporting about respondents’ own household.


Reliability of secondary reporting

Table 1 presents the prevalence of self-reported and secondary-reported GBV (physical violence, intimate partner rape, and non-intimate partner rape). Using reported prevalence from all study groups, ICCs were calculated for each category of violence and for each of the three reporting population comparisons of interest (Table 3). ICCs for the comparison of respondent vs. neighbor head-of-household and for respondent household vs. neighbor household were generally high across all forms of violence, with all ICCs greater than 0.9, suggesting high reliability between secondary reporting and self-reporting [26]. This indicates that secondary reporting by neighbors is approximately as consistent as self-reporting in ascertaining experiences of GBV in households, and that it is consistent in its identification of households with higher or lower rates of violence. For the respondent vs. sisters comparison, however, reliability was poor, with all ICCs under 0.9, indicating that secondary reporting from sisters was less robust for reporting prevalence of GBV as self-reporting. This low ICC for sisters was likely affected by limited data, as comparison data for sisters was only collected at the Uganda sites. Additionally, reports for sisters may have been lower due to social or physical distance.

Table 3 Prevalence of self-reported and secondary reported gender-based violence and results of two-sample z-test for differences in prevalence by reporting population

Finally, when ICCs were calculated in sub-group analyses for adult women ≥18 years of age and girls < 18 years of age, a lower ICC of 0.722 was found for reports of rape perpetrated against girls in a respondent’s household vs. a neighboring household as shown in Tables 4 and 5. To assess reliability at the level of individual interviews, 23 matched interviews were performed in Kebribeya Camp in Ethiopia, wherein two neighbors were asked to report on a third, common neighbor. Forty-two out of 74 reports of physical violence within the recall period (57%) ‘matched,’ or were simultaneously reported by two neighbors about the same third neighbor. A total of 35 incidents of rape were reported among the matched interviews. Fifteen of these 35 incidents of rape (43%) were reported by both neighbors about the same third neighbor. Cohen’s kappa statistics for both physical violence and rape (Table 6) show statistically significant (p < 0.001) ‘moderate’ agreement between matched neighbor reporting, suggesting, in this case, moderate reliability or consistency in patterning between both secondary reporting and self-reporting [27].

Table 4 Intra-class correlation coefficients for comparisons of three different types of secondary reporting vs. standard reporting among women (≥ 18 years of age)
Table 5 Intra-class correlation coefficients for comparisons of three different types of secondary reporting vs. standard reporting among girls (< 18 years of age)
Table 6 Intra-class correlation coefficients for comparisons of three different types of secondary reporting vs. standard reporting

Validity of secondary reporting

Table 1 presents the results of two-sample z-tests for proportions to assess overall correspondence in rates of prevalence i.e. whether reported prevalence of violence was significantly different between reporting populations. The results of this analysis were mixed. For secondary reporting about neighbor head-of-household compared to self-report, all but one study sample were found to have significantly lower secondary-reported rates of violence. This finding was observed for all three forms of GBV across three different study settings. Non-intimate partner rape assessed in Uganda showed no significant difference in the prevalence reported about neighbors compared to the self-reported prevalence.

For secondary reporting about neighboring households compared to respondents’ households, there were no significant differences for 18 out of 24 such comparisons, suggesting more consistent correspondence in rates of prevalence.

Of the six comparisons of respondents’ households vs. neighbors’ households that did show statistically significant differences in GBV, two study samples (involving physical violence against adult women in Ethiopia and non-intimate partner rape against adult women in Sri Lanka) had lower prevalence of violence reported in the neighbors’ households. For the remaining four study samples (two of physical violence against women and girls in Liberia, one of non-intimate partner rape in Liberia, and one of rape of girls in Ethiopia), higher prevalence of violence was reported in the neighbors’ households compared to the respondents’ households.

Finally, secondary reporting about respondents’ sisters (data collected in Uganda only) yielded statistically significantly lower prevalence rates than self-reporting for physical violence and rape by an intimate partner. No statistically significant difference was found for the less frequent reporting of non-intimate partner rape.


With the lack of any clear basis for establishing a ‘gold standard’ of prevalence of sexual violence and clear potential for risks associated with reporting in insecure environments, surveys of violence against women in humanitarian settings are widely seen as likely to involve under-reporting. This limitation has also been documented in wealthier countries [12], and under-reporting was indicated in our one study where we asked different women about violent events in the same neighboring household. This suggests that the frontier of advancing the science of documenting GBV may not involve having a perfect survey method that captures all events. Instead, the objective perhaps should be to have complete enough documentation to understand the magnitude and various kinds of violence occurring in a specific setting and to record it with a reproducible process that will document changes over time. This approach of monitoring that misses some cases but captures patterns and eruptions has served the polio and smallpox eradication programs well [28, 29]. Similar patterning was also noted with the original use of the Sisterhood Method [30, 31], and a recent attempt to use lot quality assurance sampling (LQAS) to measure GBV in emergencies [32]. To this end, the Neighborhood Method seems to perform well compared with the standard household survey.

The results presented above illustrate that ICCs showed a generally high level of consistency in identifying individuals and households at higher and lower risk for GBV. However, in terms of estimated prevalence of GBV based on primary and secondary reports, there is clear variation with respect to the reporting population that is being addressed. Amongst women reporting on themselves and their neighbors, secondary reporting on neighbors generally resulted in a lower estimate of prevalence than self-report. In contrast, prevalence estimates based upon secondary reports of GBV in neighbors’ households and reports of GBV within the respondents’ household showed much higher levels of statistical correspondence.

The lower prevalence estimates for neighbors vs. selves may represent a respondent’s lack of knowledge about her neighbor’s experience with GBV or a bias against disclosing information about one’s neighbors. Although it is theoretically possible that the higher incidence from self-reporting could reflect that the standard self-reporting approach may be biased towards over-reporting, this is unlikely given the literature showing that GBV tends to be underreported due to stigma and other negative repercussions for the survivor [3, 7, 8].

For the comparisons between a respondent’s household and prevalence reported about the neighbor’s household, we note that the standard approach is in fact a form of secondary reporting, as respondents are asked to report on other women and children living in their own household. Data about the respondent’s household is similarly based on the assumption that an adult female has complete and accurate information about the women and children living under her care. This information is, therefore, also subject to similar biases of knowledge, non-disclosure, and social desirability as the respondent likely views herself as being responsible for the wellbeing and safety of others in the same household. Reporting about one’s household, however, still reflects a common and standard approach for assessing population health in conflict and development settings, and we thus compare it with the novel approach of asking about the respondents’ neighbor’s household. For comparisons between ‘standard’ secondary reporting about respondents’ households vs. novel secondary reporting about neighboring households, statistical tests for the most part failed to detect any significant difference in the prevalence of GBV, suggesting correspondence in overall rates of prevalence and that secondary reporting about neighboring households may be as valid as reporting about respondents’ own households on GBV.

Additionally, in cases where there was less consistency in patterning between secondary reporting and self-reporting, higher reported rates of violence in neighboring homes for children suggest that this novel type of secondary reporting may foster higher rates of disclosure, especially if social desirability bias prevents females from reporting events in their own households. These findings raise the potential that secondary reports on neighbor’s children may be more reflective of the truth than self-report on children in the respondent’s household. If supported by additional testing, this finding could have important implications for measuring violence against children – a growing measurement trend – and suggests that secondary reporting has the potential to reveal better data for younger populations than current assessments.

Taking both reliability and validity into consideration, overall findings suggest similar levels of reporting between the Neighborhood Method and standard self-reporting when looking across household data. The possibility exists that the consistency between self-reported household rates vs. neighbor rates involves under-reporting on both populations through true limitations of knowledge or reluctance on the part of the interviewee. There was little data available for us to assess the use of secondary reporting on respondents’ sisters’ experiences, as we did not use this form of reporting outside of Uganda. One barrier to collecting this data in other settings is the low likelihood of sisters knowing about each other’s experiences, especially for sensitive topics such as GBV, where chronic civil unrest and large population movements may limit communication and such intimate knowledge. Other factors that may influence the variability in patterns of difference between standard and secondary reported GBV incidence include the geographic distribution of households, where rural households in some settings may be too dispersed for respondents to accurately know their neighbors’ experiences, and cultural norms in different populations that are relevant to disclosure of GBV.

Limitations of this analysis include the lack of a true ‘gold standard’ for validity testing, such that self-reporting methods as means of measuring GBV incidence is not itself definite. We are thus limited to conducting validity testing for the non-inferiority of secondary reporting compared to the usual, standard approach. While our primary and secondary samples were powered to compare prevalence rates, we acknowledge that other factors such as internal variation and larger confidence intervals will also factor into whether our comparisons showed significance. Finally, the four studies included in this paper all focused on gender-based violence, thus limiting the generalizability of the results to understand the use of secondary reporting for other population health concerns.


In a humanitarian culture driven by an imperative to deliver assistance, often at the expense of rigorous assessment and evaluation, alternative measurement approaches better suited to contexts of war and disaster are needed. Without some rate-based measure of GBV, trends or assessments of preventative measures will not be easily evaluated. This analysis offers important initial insights into the reliability and validity of secondary reporting as a tool for field epidemiologists in humanitarian settings. Further exploration of secondary reporting will strengthen our understanding of whether and when secondary reporting is a viable alternative or supplement to standard methods.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.



Gender-based violence


Intra-class correlation


Intimate partner violence


  1. UNHCR. Sexual and gender based violence: UNHCR; 2016. Available from: Cited 2019 Aug 13.

  2. Stark L, Warner A, Lehmann H, Boothby N, Ager A. Measuring the incidence and reporting of violence against women and girls in liberia using the “neighborhood method”. Confl Health. 2013;7(1):20.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Heise L, Ellsberg M, Gottmoeller M. A global overview of gender-based violence. Int J Gynecol Obstet. 2002;78(S1):S5–14.

    Article  Google Scholar 

  4. Checchi F, Roberts L. Documenting mortality in crises: what keeps us from doing better? PLoS Med. 2008;5(7):e146.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Stark L, Sommer M, Davis K, Asghar K, Baysa AA, Abdela G, et al. Disclosure bias for group versus individual reporting of violence amongst conflict-affected adolescent girls in DRC and Ethiopia. PLoS One. 2017;12(4):e0174741.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Van Dijk J, Meyhew P. Criminal victimization in the industrial world: key findings of the 1989 and 1992 international crime surveys. In: Underst Crime Exp Crime Crime Control; 1993. p. 1–50.

    Google Scholar 

  7. Koss MP. Detecting the scope of rape: a review of prevalence research methods. J Interpers Violence. 1993;8(2):198–222.

    Article  Google Scholar 

  8. Koss MP. The under detection of rape: methodological choices influence incidence estimates. J Soc Issues. 1992;48(1):61–75.

    Article  Google Scholar 

  9. Babbie E. Survey research methods. 2nd ed. Belmont: Cengage Learning, Inc; 1990. Available from:

    Google Scholar 

  10. Silva R, Price M. Trade-offs in using indirect sampling to measure conflict violence. JAMA. 2011;306(5):547–8.

    Article  CAS  PubMed  Google Scholar 

  11. Ellsberg M, Jansen HA, Heise L, Watts CH, Garcia-Moreno C. Intimate partner violence and women’s physical and mental health in the WHO multi-country study on women’s health and domestic violence: an observational study. Lancet. 2008;371(9619):1165–72.

    Article  PubMed  Google Scholar 

  12. Russell D, Bolen R. The epidemic of rape and child sexual abuse in the United States: SAGE Publishing; 2000. p. 336. Available from:

  13. Russell DE. The prevalence and incidence of forcible rape and attempted rape of females. Victimology. 1982;7(1–4):81–93.

    Google Scholar 

  14. Khawaja M, Tewtel-Salem M. Agreement between husband and wife reports of domestic violence: evidence from poor refugee communities in Lebanon. Int J Epidemiol. 2004;33(3):526–33.

    Article  PubMed  Google Scholar 

  15. Jewkes R, Sikweyiya Y, Morrell R, Dunkle K. Gender inequitable masculinity and sexual entitlement in rape perpetration South Africa: findings of a cross-sectional study. PLoS One. 2011;6(12):e29590.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. WHO ethical and safety recommendations for researching, documenting and monitoring sexual violence in emergencies. Geneva: World Health Organization (WHO); 2007. Available from: Accessed 1 July 2020.

  17. Stark L, Roberts L, Wheaton W, Acham A, Boothby N, Ager A. Measuring violence against women amidst war and displacement in northern Uganda using the “neighbourhood method”. J Epidemiol Community Health. 2010;64(12):1056–61.

    Article  CAS  PubMed  Google Scholar 

  18. Parcesepe A, Stark L, Roberts L, Boothby N. Measuring physical violence and rape against Somali women using the neighborhood method. Violence Against Women. 2016;22(7):798–816.

    Article  PubMed  Google Scholar 

  19. Rogers B, Anderson L, Stark L, Roberts L. Estimating the incidence of physical and sexual violence against children and women in Trincomalee District, Sri Lanka: Save the Children; 2009. Available from:

  20. Warner A. Incidence of violence against women and girls in Liberia: a quantitative study using the “neighborhood method”: International Rescue Committee and the Program on Forced Migration and Health, Mailman School of Public Health, Columbia University; 2007. Available from:

  21. Deaux E, Callaghan J. Key informant versus self-report estimates of health-risk behavior. Eval Rev. 1985;9(3):365–8.

    Article  Google Scholar 

  22. Amowitz LL, Reis C, Lyons KH, Vann B, Mansaray B, Akinsulure-Smith AM, et al. Prevalence of war-related sexual violence and other human rights abuses among internally displaced persons in Sierra Leone. JAMA. 2002;287(4):513–21.

    Article  PubMed  Google Scholar 

  23. Home - UNICEF MICS. Available from: Cited 2020 Apr 12.

  24. The DHS Program - Quality information to plan, monitor and improve population, health, and nutrition programs. Available from: Cited 2020 Apr 12.

  25. Stark L. From incidents to incidence: measuring sexual violence amidst war and displacement [Public Health]. New York: Columbia University; 2010.

    Google Scholar 

  26. Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86(2):420–8.

    Article  CAS  PubMed  Google Scholar 

  27. Fleiss JL. Statistical methods for rates and proportions. 2nd ed. New York: Wiley; 1981.

    Google Scholar 

  28. Blake IM, Chenoweth P, Okayasu H, Donnelly CA, Aylward RB, Grassly NC. Faster detection of poliomyelitis outbreaks to support polio eradication. Emerg Infect Dis J. 2016;22:3 CDC. Available from: Cited 2020 Jun 19.

    Article  Google Scholar 

  29. Lane JM. Mass vaccination and surveillance/containment in the eradication of smallpox. In: Plotkin SA, editor. Mass vaccination: global aspects — progress and obstacles, Current topics in microbiology and immunology. Berlin, Heidelberg: Springer; 2006. p. 17–29. Cited 2020 Jun 19.

    Chapter  Google Scholar 

  30. Graham W, Brass W, Snow RW. Estimating maternal mortality: the sisterhood method. Stud Fam Plan. 1989;20(3):125–35.

    Article  CAS  Google Scholar 

  31. World Health Organization, Department of Reproductive Health and Research. The sisterhood method for estimating maternal mortality: guidance notes for potential users: World Health Organization (WHO); 1997. p. 28. Report no.: WHO/RHT/97.28. Available from:;jsessionid=93A83FFC2272660A857FE9A69F071A73?sequence=1. Cited 2020 Jun 19.

  32. Murphy M, Ovince J, Registe PPW, Jean-Claude U, Contreras M. Small sample size surveys for GBV programming: main results report: Global Women’s Institute at George Washington University, Institut de Formation de Sud; 2018. p. 22. Available from:

Download references


Not applicable.


Not applicable.

Author information

Authors and Affiliations



LS, LR, and AA led the study design, data collection, and interpretation of the analysis. LS led the writing of the manuscript. TMT substantially contributed to the data manuscript draft. GY led the data analysis and reviewed the manuscript. AN proofread and edited the manuscript. All authors read and approved the final version of the manuscript. LS had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.

Corresponding author

Correspondence to Lindsay Stark.

Ethics declarations

Ethics approval and consent to participate

Ethical approval was provided by Columbia University Medical Center’s Institutional Review Board. (AAAB7134).

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Stark, L., Roberts, L., Yu, G. et al. Evaluating the reliability and validity of secondary reporting to measure gender-based violence in conflict and disaster. Confl Health 14, 57 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: