Methods for estimating vital rates of greater sage-grouse broods: a review

Biologists use a variety of methods to estimate productivity and resource selection of birds. The effectiveness and suitability of each method depends on the study's objectives, but is also influenced by many important traits, including detection probability, disturbance of focal birds and sampling frequency. We reviewed 504 greater sage-grouse Centrocercus urophasianus papers published from 1990 to 2019 to document the most common brood survey methods used by investigators and summarized if and how they used brood survey data to estimate brood survival and detection probability. Of the 504 papers, 16.1% (n = 81) had useful information relevant to the review. The most common methods included daytime visual surveys (46.9%; n = 38), daytime flush surveys (33.3%; n = 27), nocturnal spotlight surveys (19.8%; n = 16), radio-tagged chicks (16.0%; n = 13), wing surveys (9.9%; n = 8), brood routes (4.9%; n = 4) and pointing dogs (4.9%; n = 4). Fifty-nine of the 81 papers used >1 method, only 2 of the 81 papers measured or reported detection probability, and none reported the level of disturbance caused by the method. Studies varied widely regarding the age of the brood when brood fate was confirmed (x̄ = 44.4 days post-hatch, range 14–84 days). The frequency of brood sampling visits also varied greatly among studies (range = 1.19–3.85 surveys/brood/week) and this variation complicates comparison in fecundity and survival estimates across studies. Furthermore, 35 papers used >1 maternal behavior as purported indicators of brood fate, but none of them documented how accurate their indicators were. Future studies could reduce variance in estimates of sage-grouse fecundity and brood survival by employing empirical methods to estimate detection probability, standardizing brood sampling methods and conducting trials to document the effects of hen or brood capture, handling and flushing on brood survival estimates. Moreover, the accuracy of commonly used indicators of brood fate, including maternal behaviors, flocking behavior and distance moved after flush needs verification.

Life history determines the vital rates of plant and animal populations and, hence, measuring vital rates is essential for both basic and applied sciences. For example, vital rates form the basis of population viability models, inform decisions regarding annual harvest limits, and are often triggers for conservation actions. Similarly, resource selection studies are common in wildlife ecology and they are frequently used to inform land management decisions. Hence, estimating and comparing vital rates and resource selection of wildlife populations is a common goal of managers and researchers. Vital rates and resource selection are best if measured by using field methods that maximize accuracy and precision, mini-mize disturbance to the focal animals, and make estimates comparable across time, space, species and studies.
The need for accurate and precise methods to estimate vital rates and patterns in resource selection are particularly important in wildlife populations that are harvested and those that are rare or declining (i.e. species of conservation concern). Many birds in the order Galliformes are both gamebirds and species of conservation concern due to their declining populations, habitat loss and range contractions (Storch 2007, McGowen et al. 2012. Moreover, population growth rates for Galliformes are particularly sensitive to annual fecundity (i.e. chick survival, brood survival and other measures of productivity) (Hagen 2003, Summers et al. 2004, Sandercock et al. 2008, Taylor et al. 2012. Hence, identifying the environmental factors that affect productivity of gamebirds is necessary for effectively managing their populations. However, the methods commonly used to measure productivity and the inferences from commonly used analytic methods (Williams et al. 2002) have assumptions that are difficult to meet for many gamebird species. Specifically, detection errors are common for many avian sampling methods but are often not measured by investigators (Nichols et al. 2000, MacKenzie et al. 2002, Williams et al. 2002, Buckland et al. 2015. Moreover, incidental disturbance (i.e. flushing, radio-marking, etc.) caused by field methods is seldom quantified.
The greater sage-grouse Centrocercus urophasianus (hereafter sage-grouse) is both a gamebird and a species of conservation concern in North America due to long-term declines in abundance (Western Association of Fish and Wildlife Agencies 2015) and distribution contraction (Schroeder et al. 2004). Chick survival has a large influence on population growth rates in sage-grouse (Wisdom andMills 1997, Taylor et al. 2012) and, hence, estimating chick survival, brood survival, and the factors that affect them (e.g. resource selection) will help guide conservation and management decisions (Dahlgren et al. 2006, Atamian et al. 2010, Guttery et al. 2013, Coates et al. 2017. The methods used to estimate chick and brood survival in sage-grouse and the habitat factors that affect those parameters are the same as those used for other gamebirds (Summers et al. 2004, Steen and Haugvold 2009, Sands and Pope 2010, Dahlgren et al. 2012, Orange et al. 2016, Blomberg et al. 2019). These methods (daytime visual and flush surveys, radio-tagging, nighttime spotlight surveys, pointing dog surveys and wing surveys; Dahlgren et al. 2010a, Riley 2019 can potentially influence accuracy, precision and comparability among studies. Our objectives were to summarize the extent to which past sage-grouse studies accounted for detection probability and documented levels of disturbance caused by their field methods. To help guide future research and monitoring efforts on gamebirds, we conducted a thorough review of the methods used in past sage-grouse papers whose objectives were to estimate vital rates and habitat relationships related to sage-grouse chicks and broods.

Methods
We reviewed journal articles, theses, dissertations and book chapters (hereafter, papers) that reported one or more demographic parameters associated with sage-grouse chicks or broods habitat selection, occupancy, or movement of broods. In our review, we recorded whether papers measured hen productivity metrics, chick or brood success, or chick or brood survival. We defined productivity as the rate at which breeding-aged hens produce chicks or broods that subsequently are recruited to the population (Leopold 1933, p. 452), chick success as the number of fledged chicks per hen during a specific period of time, brood success as the number of broods that had ≥1 chick fledged per hen during a specific period of time, and chick or brood survival as the probability of chicks or broods alive during a specific period of time, respectively. We used Google Scholar (<https:// scholar.google.com/>) to conduct literature searches for papers that used sampling techniques to obtain the demographic data above, including: 1) survey individual radiotagged hens with their broods or radio-tagged chicks, 2) annual count of sage-grouse wings or tail feathers of different age and sex classes (i.e. wing surveys) and 3) counts of chicks or broods with hens on transects (i.e. brood routes) (Autenrieth et al. 1982, Connelly et al. 2003, Sedinger 2007, Hagen and Loughin 2008. We conducted three separate literature searches (for the three types of data enumerated above) and used sage-grouse, Centrocercus and Centrocercus urophasianus as keywords in all three searches. For radio-tagged hens or broods (no. 1 above), we also included the following secondary keywords: chick, brood and survival. For wing surveys (no. 2 above), we also included the following secondary keywords: hunt, harvest, female, juvenile, ratio, wing, production and productivity. For brood routes (no. 3 above), we also included the following secondary keywords: brood, survey and route. We restricted our review to papers that were published or completed from 1990 to 2019. We did not consider earlier studies because we wanted to focus on contemporary field and analytical practices. To ensure we did not overlook any papers, we searched references in previous sage-grouse reviews (Hagen et al. 2007, Taylor et al. 2012. For each paper, we recorded the following information: study duration, field method used (daytime visual surveys, daytime flush surveys, nocturnal spotlight surveys, radiotagged chicks, wing surveys, brood routes, dog surveys, etc.), number of broods sampled, number of surveys or visits per brood, survey intervals (i.e. days between subsequent visits to a brood), whether broods were intentionally flushed or detected visually (without flushing) during surveys, time of surveys (day or night), the proportion of hens or broods that were flushed during surveys, whether the study reported any effects of their survey methods on hen or brood survival, if detection probability was reported, how brood fate was determined and the response variables estimated (chick or brood survival, brood success, brood presence, brood habitat use, etc.). We assumed that researchers used a daytime flush survey if researchers explicitly reported that chicks or broods were flushed. We thoroughly examined each author's publication history for evidence that data were reused among multiple papers and we only selected papers that reused data if that paper also included additional or previously unexamined data.

Results
Our search produced 1403 results, of which we reviewed 549 papers that met our search criteria. Of those, 81 papers were useful for the review because they included brood survey methods used to estimate ≥1 vital rate associated with broods or chicks (Table 1): 47 journal articles, 31 graduate theses or dissertations and 3 book chapters. Among the 81 papers, the authors used field survey methods to: document brood habitat selection or space-use (54.3%; n = 44), estimate chick or brood success (43.2%; n = 35), estimate chick or brood survival (29.6%; n = 24), estimate an index of hen productivity (i.e. chick, brood or juvenile to hen ratios; 14.8%, n = 12) and address life history questions related to hen productivity (7.4%; n = 6). Most studies (64%, n = 52) used one survey method and 36% (n = 29) used 2 or 3 different survey methods. The most common methods were daytime visual surveys (46.9%; n = 38), daytime flush surveys (33.3%; n = 27), nocturnal spotlight surveys (19.8%;   n = 16), radio-tagged chicks (16.0%; n = 13), wing surveys (9.9%; n = 8), brood vehicle routes (2.5%; n = 2), brood walking routes (2.5%; n = 2) and pointing dog surveys (4.9%; n = 4). Fifty-nine studies (72.8%) made >1 visit per brood (Table 1). Nine papers included results from a daytime survey, but the text included insufficient information to classify whether they explicitly tried to flush the hen and her brood (i.e. whether they conducted a daytime flush survey or daytime visual survey). We assumed that researchers used a daytime visual survey (rather than a flush survey) if the text said that researchers explicitly 'located', 'observed' or 'confirmed' chicks or broods but didn't explicitly state that they flushed birds. Nighttime survey methods were primarily used to determine final brood fate (n = 10) or validate the results of daytime surveys (n = 3). Daytime flush surveys were also used to determine final brood fate (n = 10). Only 2 papers (Hausleitner 2003, Schreiber et al. 2016) used a nighttime survey method throughout the sampling period as their primary brood survey method.
Studies varied widely in numerous ways: goal(s) of the brood sampling; study design; sample size of brood or chicks included in the study; how they determined brood or chick fate; and statistical methodology used to obtain estimates from the survey data (Table 1). Of the 67 studies that monitored broods, 63 reported the number of broods monitored (i.e. sample size), which varied from 2-272 ( x = 49.7) broods. The sample size for 10 studies that reported the number of monitored radio-marked chicks varied from 25-518 chicks ( x = 185.7; Table 1). Eight other studies (those that did not include radio-marked chicks) reported chick numbers which varied from 14-1475 chicks ( x = 251.7; Table 1). Sixty-three studies reported estimates of >1 measure of any metrics related to chicks or broods but varied widely regarding when they recorded brood fate relative to days after hatch ( x = 44.4, range = 14-84 days) or date of presumed brood independence (range = 15 Jul-15 Aug). The frequency with which the same broods or chicks were re-surveyed varied among the primary survey methods used from 1.19-1.43 surveys per brood per week for daytime flush surveys, 1.62-1.92 surveys per brood per week for daytime visual surveys (without data from the obvious outlier of Huwer et al. 2008), 3.71-3.85 surveys per brood per week for radio-marked chick monitoring, and 2.25-3.50 surveys per brood per week for nighttime surveys. The study that included the most frequent revisits to broods included 168 daytime visual surveys per week over 27 days for a study designed to estimate chick diet and survival rates of humanimprinted, released (rather than wild) chicks (Huwer et al. 2008). None of the 81 studies recorded the proportion of hens or broods that were flushed during their brood surveys. Five of 12 studies that used radio-marked chicks reported mortalities caused by capture or handling, and 4 of the 12 studies reported censoring survival data when a chick's fate was unknown (i.e. went missing). Thirty-five studies used the following indirect or ancillary clues to classify brood status or fate: hen distracting behavior (57.1%; n = 20), hens observed without chicks on >1 consecutive surveys (60.0%; n = 21), the presence of other yearling or adult sage-grouse (17.1%; n = 6), or large hen movements (≥ 1-3 km) relative to prior survey(s) (11.4%; n = 4). Brood survival ranged from 10 to 100% (Table 1)  The author used their own or another author's previously examined data, but also used included previously unexamined data.
g The author used human-imprinted chicks to estimate diet selection and chick survival. Authors surveyed chicks 24 times per day thus the extraordinary survey number.
h VAR = Author quantified long-term productivity among states in the USA. Years vary.
survival from survey data varied among papers (Table 1), but 23 papers used statistical methods that required a clear binary decision on brood fate (i.e. dead or alive), 31 papers calculated apparent success, and only 1 of the 81 papers (Dahlgren et al. 2010a) provided estimates of detection probability associated with their survey method(s) (Table 1). Furthermore, 1 paper (Gibson et al. 2017) accounted for chick detection via an extension of a Cormack-Jolly-Seber model (young survival model; Lukacs et al. 2004), but the authors did not report their estimates of chick detection probability.

Discussion
Conservation and monitoring programs are most effective and efficient when they include careful consideration of factors that influence the precision and accuracy of key metrics commonly used to make management and policy decisions, such as vital rates and patterns of resource selection. We reviewed the literature on greater sage-grouse and we found substantial variation among studies in methods used; none of the sage-grouse brood survey methods and sampling designs, except wing surveys (reviewed by Connelly and Schroeder 2007) are used in consistent ways (also see Taylor et al. 2012). Some of that variation in survey methods undoubtedly reflects variation in goals of the investigators but the lack of consistency stymies comparisons across studies and populations. Based on our review of 81 papers, 71 studies have primarily used daytime visual surveys and daytime flush count surveys to estimate any metrics related to chick or brood productivity metrics of sage-grouse and nearly all did so without estimating or accounting for variation in detection probability. Indeed, we found only two studies (Dahlgren et al. 2010a, Gibson et al. 2017) that estimated or accounted for detection probability when using data from daytime surveys to estimate brood survival or chick survival, respectively. Perhaps most importantly, we found wide disparity among studies in the cues or triggers used to infer brood fate, and substantial variation among studies in the frequency and duration of surveys per brood. This variation makes comparisons among studies difficult and limits an investigator's ability to put their study results into proper context. Variation in detection probability among brood survey methods is likely most pronounced at younger brood ages and age-specific variation in detection probability may be more prominent for some methods than others (Riley et al. unpubl.). Moreover, no prior studies explicitly reported the frequency with which hens or chicks were flushed during surveys and so we currently lack estimates regarding the relative disturbance caused by the different brood survey methods. If the frequency with which hens and chicks are flushed is reported in future studies, investigators can make more informed decisions regarding which brood survey methods to use and how many repeated visits to include in their sampling protocol. And studies are needed to document the effects of repeatedly flushing hens and chicks on body condition and survival. Based on our review of the average frequency and duration of past studies, future investigators whowish to estimate brood or chick survival in a manner that maximizes their ability to compare their results to others should survey broods once per week during the first 44 days after hatch for daytime flush surveys, twice per week for daytime visual surveys, and thrice per week for radio-tagged chicks. For investigators who want to estimate chick or brood survival during early stages just after hatching (e.g. post-hatch to 14 DAH), it is important to at least acknowledge that their estimated chick survival or brood fate may be affected by disturbance caused by the methods mentioned above. Researchers who wish to estimate early brood survival may want to consider using a brood survey method that is less disruptive such as nighttime roost-site fecal surveys (Riley and Conway unpubl.). For investigators who want to calculate unmarked chick or brood success with high detection probability, the best approach is likely nighttime spotlight surveys, nocturnal roost-site fecal surveys or dog surveys (Dahlgren et al. 2010a, Riley 2019, or the use of >1 method or >1 observer. Future researchers could increase the accuracy of gamebird vital rates and patterns of resource selection by explicitly estimating and accounting for detection probability with appropriate analytic and sampling approaches. Past studies have often inferred sage-grouse brood fate based on untested assumptions regarding hen or chick behavior observed during surveys. For example, hens with broods will sometimes act 'broody' or use protective or distractive behaviors when approached by humans. However, the validity of these behaviors as reliable cues of brood status (alive or dead) needs verification especially when studies do not account for imperfect detection, rely on a single survey to determine fate, or when no chicks are seen. Hen behaviors such as staying close to the flush site (i.e. flushing short distances), feigning injury (e.g. broken-wing or wing-drag display), or rushing towards the observer have also been used as indicators of brood fate in sage-grouse (Atamian et al. 2010, Lebeau et al. 2017. Willow ptarmigan Lagopus lagopus use some of these defensive behaviors when they have broods (Sandercock 1994), but the validity of this behavior for inferring brood fate has not been quantified with any grouse species to our knowledge. Moreover, some studies have inferred brood fate based on when a hen moves >1 km between subsequent telemetry locations or in response to a flush count survey (Moynahan 2004, Dzialak et al. 2011 or when a hen flocks with other hens (Sandford et al. 2017). However, a large movement in response to flushing is not necessarily an indication of a recent brood mortality; sagegrouse hens with broods in Nevada that moved further had a lower probability of survival (Gibson et al. 2017), but hens with intact broods (even those with very young broods) in Idaho sometimes moved 1-3 km in a day (Riley and Conway unpubl.). Daily movements of willow ptarmigan broods in Norway were highly variable (typically 14-514 m per day when they were 14-21 days old), but some broods traveled >2 km in one day with no apparent effect on survival (Steen and Haugvold 2009). Hence, the use of hen behaviors to infer brood fate should be used judiciously until future research provides evidence that one or more of these behaviors are indeed reliable indicators of brood fate.
Very few prior studies on gamebirds have estimated detection probability associated with brood survey methods (Andes et al. 2012, Orange et al. 2016, Riley 2019. Chick detection probability during surveys is <100% for most gamebirds (Wing et al. 1944, Kubisiak 1978, Schroeder 1997, Dahlgren et al. 2010a, Ludwig et al. 2010. For example, chick detection probability was 0.72 (Dahlgren et al. 2010a) for 35-47-day-old sage-grouse chicks in Utah. Brood detection probability is higher than detection probability of individual chicks but is still typically <100%; brood detection probability during daytime flush counts was 0.86-0.95 for 36-47-day old sage-grouse broods in Idaho but was lower for younger-aged broods (Riley et al. unpubl.). Brood detection probability during daytime visual surveys varied with brood age for sage-grouse in Idaho, ranging from 0.618 just after hatch (95% CI = 0.440-0.770) to 0.881 at 47 days after hatch (95% CI = 0.671-0.964) (Riley et al. unpubl.). Nighttime spotlight surveys had high detection probability in Idaho (0.95-1.00) and Utah (1.00) for older sage-grouse broods (35-47 days after hatch) (Dahlgren et al. 2010a, Riley 2019), but detection rates were much lower for younger broods (1-30 day after hatch) (Foster et al. 2013, Riley 2019. We are unaware of any study that has evaluated intrinsic or extrinsic factors that influence detection probability of sage-grouse broods, regardless of the survey method. Future studies are needed on the following topics: 1) how sampling methods influence detection for other species, 2) sampling methods with minimal disturbance such as nighttime roost-site fecal surveys (Riley 2019) and thermal cameras (Andes et al. 2012) for chick or brood counts and 3) the influence of topography, vegetation, temperature, weather, observers and other factors on detection probability of brood survey methods.
Some brood survey methods for gamebirds may influence survival probabilities due to disturbance caused while flushing the hen or chicks. Intentionally or unintentionally flushing the hen or chicks could decrease survival via increased probability of chick abandonment, increased susceptibility to predation or increased energy expenditure (Riley 2019). Observers intentionally try to flush both the hen and chicks during daytime flush counts, whereas observers try not to do so during daytime visual and nocturnal spotlights counts. However, past studies vary widely regarding the frequency of surveys on individual broods. For example, observers that conducted daytime flush counts flushed hens and chicks from 1 to 14 times. Also, incidental flushes occur even with methods that are not designed to intentionally flush either chicks or the hen (Riley 2019) and not all hens and chicks flush even with methods where observers explicitly try to flush them. Future investigators can help inform others when developing future study design and protocols by quantifying the disturbance caused while conducting brood or chick surveys.
Use of radio-marked chicks or pointing dogs are viewed by some researchers as the most reliable methods to estimate chick or brood survival, but few studies have reported detection probability associated with these methods and whether detection rates vary with brood age or brood size. Pointing dogs are used extensively in Europe and are used less commonly in North America to locate chicks or broods (Connelly et al. 2003, Dahlgren et al. 2012). Pointing dogs located 21 of 22 (96%) 5-to-8-week-old sage-grouse chicks (Dahlgren et al. 2010a), but the accuracy of this method has not been tested on younger sage-grouse broods and detection probability likely varies among dogs (Connelly et al. 2003, Dahlgren et al. 2012, Orange et al. 2017, habitat conditions, trainers, weather, etc. Furthermore, pointing dogs have occasionally eaten chicks and surveyors have incidentally stepped on and killed chicks (typically <14 days after hatch) during surveys (M. Schroeder unpubl., Conway unpubl.). Standardized protocols that include efforts to reduce chick mortality are needed to better assess and reduce biases associated with pointing-dog surveys (Gutzwiller 1990, Dahlgren et al. 2012. Detection probability of radio-marked chicks (Larson et al. 2001, Burkepile et al. 2002, Gregg et al. 2007, Dahlgren et al. 2010b, Steen and Haugvold 2012 is often assumed to be 100%, but lost signals are a form of imperfect detection, and right-censoring of missing signals can produce biased survival estimates (Blomberg et al. 2019). Moreover, adverse reactions to handling or attaching transmitters to chicks may confound survival estimates (Amundson and Arnold 2010, Steen and Haugvold 2009, Taylor et al. 2012, Baxter et al. 2013, Blomberg et al. 2019. Lastly, radio-marked chicks may not accurately estimate brood survival unless researchers are certain that all chicks within a brood are radio-tagged. Wildlife agencies often rely on field methods like wing surveys and brood routes to obtain population-level information about trends in productivity and other vital rates (Hagen and Loughin 2008, Broms et al. 2010, Sands and Pope 2010, Hansen et al. 2015. We found very few papers that had used sage-grouse wing surveys to estimate productivity. Moreover, we are unaware of any study that has explicitly quantified classification error of wings which likely varies among observers based on experience, and its accuracy is likely affected by unusual molt patterns (Braun and Schroeder 2015) and wing condition. Several studies have documented the validity of grouse production indices based on wing surveys (Flanders-Wanner et al. 2004, Hansen et al. 2015 or based on a combination of wing surveys and band-recoveries (Hagen et al. 2018). Despite this, production indices are potentially biased if differential hunter kills occur among grouse sex and age classes (Zwickel 1982, Flanders-Wanner et al. 2004, Bunnefeld et al. 2009, Asymhr et al. 2012. For example, hunters may select sage-grouse hens or juveniles when they congregate near springs (i.e. density-dependent harvest selection), brood-less hens and males flush more readily than hens with broods (Riley unpubl.), adult males are viewed as trophies (D. Musil unpubl.), and these and other motivations produce sex-biased harvest (Guttery et al. 2015). Documenting sex, age or size biases associated with harvest can help ensure that those biases are accounted for when using wing surveys to estimate population parameters. We found few papers that used brood vehicle routes (n = 2) to quantify sage-grouse vital rates, despite the apparent use of this method by wildlife agencies to inform management and harvest decisions (Willis et al. 1993, Connelly and Schroeder 2007, Sands and Pope 2010. Brood vehicle routes have been used to document temporal trends in brood production (e.g. number of juveniles per hen) for numerous North American gamebirds because agencies can monitor large areas and multiple species in short time spans (Sands and Pope 2010). However, data from brood vehicle routes may not accurately estimate productivity due to: low or unknown detection rates (Hansen et al. 2015), use of convenience sampling to select the survey routes (Sands and Pope 2010), differential detection of brood-less hens or males compared to hens with broods, and detection rates may be affected by environmental factors. Additionally, we found few studies (n = 2) that used brood walking routes (i.e. line or strip transects). Brood walking routes are difficult to implement on large scales without substantial costs and, therefore, are impractical for monitoring productivity in most gamebirds.
Appropriate sampling design is needed to increase the accuracy of brood productivity parameters. Measures of annual fecundity and their relationship to habitat characteristics are likely underestimated if studies fail to explicitly account for imperfect detection of broods. Moreover, past studies varied widely regarding the brood age when surveys were conducted and this variation limits comparability among studies if detection rates or daily survival vary by brood age (and both typically do; Riley et al. unpubl.). Brood surveys are often designed to estimate survival during a biologically relevant time period (e.g. from hatch until the date when chick fate is independent of hen fate). By having a discrete time period, hypothetically, researchers can standardize the sampling frequency to conduct brood surveys. However, there is no clear consensus regarding what that time period is for broods given the contradictory evidence about when chicks can survive without a hen (Dahlgren et al. 2010b, Thompson 2012 and that age will vary among species and perhaps even among populations within species. We need more information regarding the precision and accuracy of different levels of brood survey effort to achieve defensible survival estimates for most species. Future researchers could standardize the frequency and duration of surveys per brood based on life history (e.g. when thermoregulation or flight is possible, etc.), behavior (e.g. flight versus hiding when reacting to surveyors, etc.), and length of the time when chicks are dependent on the hen. Additionally, researchers could use simulations to evaluate how much sampling effort is needed to estimate survival, resource selection or occupancy (Bailey et al. 2007) during an explicit range of brood ages.
Accurate estimates of demographic parameters, and the habitat features that affect those parameters, are essential for management (MacKenzie et al. 2002, Royle andLink 2006). Brood surveys are commonly used to estimate fecundity and resource selection of gamebirds (Ludwig et al. 2010, Anderson et al. 2015, Sandford et al. 2017, Geaumont and Graham 2020 and past investigators have used numerous brood survey methods, but very few sage-grouse studies that have used brood survey methods have measured or accounted for detection probability. Incorporation of detection probability is more common in resource selection studies including survival models that accommodate detection probabilities and resource covariates (Lebreton et al. 1992) and occupancy models that incorporate detection functions (MacKenzie et al. 2002, Royle and Link 2006, Kéry and Royle 2016.
Brood amalgamation, where >1 chick is adopted by another hen, is an alloparental care strategy common in Galliformes (Keppie 1977, Lott and Mastrup 1999, Dahlgren et al. 2010b, Steen and Haugvold 2012, Orange et al. 2016), but its frequency likely varies among populations and is rarely reported and, hence, may bias survival estimation (Flint et al. 1995). With sage-grouse, the frequency of brood amalgamation varies among populations based on a few studies that have monitored radio-marked chicks (2.0-51%) and broods (8.3-43%) (Connelly et al. 2006, Gregg et al. 2007, Dahlgren et al. 2010b, Gruber 2012. Studies that do not use radio-marked chicks to account for brood amalgamation have an unknown amount of bias introduced into their estimates of chick or brood survival. Hence, more studies are needed to estimate brood amalgamation rates and how they vary among populations, years, brood ages and species. A common theme among many analytic approaches commonly used to estimate brood survival is that they require a clear binary outcome (e.g. alive or dead) for each individual brood or chick (Andersen and Gill 1982, Dinsmore et al. 2002, Williams et al. 2002. It is unclear whether surveys without marked chicks or without complete observations can achieve this outcome; although some authors report that repeated surveys or the use of nighttime surveys is enough to validate brood fate (LeBeau et al. 2017). Given that most brood survey methods have imperfect detection, future studies can reduce variance in survival estimates by using Cormack-Jolly-Seber models (Lebreton et al. 1992, Lukacs et al. 2004, Schaub and Royle 2014, occupancy models (MacKenzie et al. 2002, Kéry andRoyle 2016), or by conducting >1 visit, using >1 method or including >1 independent observer (Williams et al. 2002, Buckland et al. 2015, Clement et al. 2017).

Conclusions and recommendations
Sage-grouse and other gamebird populations have declined (Storch 2007, McGowen et al. 2012) and future research on brood survival, hen productivity and resource selection will help inform management decisions including their response to environmental and anthropogenic conditions. The following strategies would help to improve accuracy of parameter estimates and reduce disturbance and their incorporation into gamebird brood monitoring protocols will improve inferences and comparisons across studies.
1. Document the factors that influence detection probability and extent of disturbance to broods and hens for all brood sampling methods used on gamebirds. 2. Minimize inaccuracy and imprecision in estimating brood fate and brood survival by accounting for detection probability with appropriate analytic and sampling approaches. 3. Standardize the frequency and duration of surveys per brood based on life history, behavior and length of the time when chicks are dependent on the hen. 4. Report the frequency of disturbance (e.g. flushing) to hens and chicks for each brood sampling method used. 5. Minimize flushing chicks early in life (typically < 14 days after hatch) when chicks are unlikely to survive without their hen. If estimates of early chick survival or success are needed, acknowledge that the survey method likely influences these estimates of productivity or consider using less invasive brood survey methods such as nighttime roost-site fecal surveys (Riley et al. unpubl.). 6. Conduct studies to validate the utility of using various hen behaviors as reliable indicators of brood fate (i.e. to more reliably assign fate when no chicks are detected). 7. Explicitly state the hen behaviors used to infer brood fate.