|
IN SITE
|
 |
BMC Urol. 2004; 4: 4.
doi: 10.1186/1471-2490-4-4. Published online 2004 June 2.
Copyright © 2004
Devillé et al; licensee BioMed Central Ltd. This is an Open Access
article: verbatim copying and redistribution of this article
are permitted in all media for any purpose, provided this notice
is preserved along with the article's original URL.
The urine dipstick test useful to rule out
infections. A meta-analysis of the accuracy
Walter LJM Devillé,1,2 Joris
C Yzermans,3 Nico P van Duijn,3 P Dick
Bezemer,4 Daniëlle AWM van der Windt,5 and
Lex M Bouter5
1Institute for Research
in Extramural Medicine, VU University Medical Center, Amsterdam,
The Netherlands
2Netherlands Institute
for Health Services Research (NIVEL), Utrecht, The Netherlands
3Department of Family
Medicine, Public Health Division, Academic Medical Centre/University
of Amsterdam, Amsterdam, The Netherlands
4Department of Clinical
Epidemiology and Biostatistics and Institute for Research in
Extramural Medicine, VU University Medical Center, Amsterdam,
The Netherlands
5Institute for Research
in Extramural Medicine, VU University Medical Center, Amsterdam,
The Netherlands
Received September 3, 2003; Accepted June 2, 2004.
|
Background
Many studies
have evaluated the accuracy of dipstick tests as rapid detectors
of bacteriuria and urinary tract infections (UTI). The lack of
an adequate explanation for the heterogeneity of the dipstick
accuracy stimulates an ongoing debate. The objective of the present
meta-analysis was to summarise the available evidence on the
diagnostic accuracy of the urine dipstick test, taking into account
various pre-defined potential sources of heterogeneity.
Methods
Literature from
1990 through 1999 was searched in Medline and Embase, and by
reference tracking. Selected publications should be concerned
with the diagnosis of bacteriuria or urinary tract infections,
investigate the use of dipstick tests for nitrites and/or leukocyte
esterase, and present empirical data. A checklist was used to
assess methodological quality.
Results
70 publications
were included. Accuracy of nitrites was high in pregnant women
(Diagnostic Odds Ratio = 165) and elderly people (DOR = 108).
Positive predictive values were =80% in elderly and in family
medicine. Accuracy of leukocyte-esterase was high in studies
in urology patients (DOR = 276). Sensitivities were highest in
family medicine (86%). Negative predictive values were high in
both tests in all patient groups and settings, except for in
family medicine. The combination of both test results showed
an important increase in sensitivity. Accuracy was high in studies
in urology patients (DOR = 52), in children (DOR = 46), and if
clinical information was present (DOR = 28). Sensitivity was
highest in studies carried out in family medicine (90%). Predictive
values of combinations of positive test results were low in all
other situations.
Conclusions
Overall, this
review demonstrates that the urine dipstick test alone seems
to be useful in all populations to exclude the presence of infection
if the results of both nitrites and leukocyte-esterase are negative.
Sensitivities of the combination of both tests vary between 68
and 88% in different patient groups, but positive test results
have to be confirmed. Although the combination of positive test
results is very sensitive in family practice, the usefulness
of the dipstick test alone to rule in infection remains doubtful,
even with high pre-test probabilities.
|
Testing for the
presence of micro-organisms in the urinary tract, in order to
diagnose asymptomatic bacteriuria or symptomatic urinary tract
infections (UTI), is very common at all levels of health care.
UTI are a common cause of fever in young children, often accompanied
by subtle and non-specific clinical findings [1].
In a small percentage of children this may lead to kidney scarring,
and at a later age to hypertension, and even renal failure [2].
In general practice, 2–3% of all consultations, and even 6% in
the case of women, are due to symptoms suggesting UTI [3].
The prevalence of asymptomatic bacteriuria is 4–7% in pregnancy,
when it can progress to symptomatic UTI, postpartum UTI or pyelonephritis
[4,5].
Untreated bacteriuria during pregnancy has been shown to be associated
with low birth-weight and premature delivery [6].
Bacteriuria is more common with increasing age. Elderly non-institutionalised
women and men show a prevalence rate of 6 – 30% and 11–13%, respectively,
while in institutionalised elderly people the prevalence ranges
from 25 to 50% [7].
Many tests are
available for the diagnosis of bacteriuria or UTI. A (semi-)
quantitative culture of a urine specimen is the only method that
can provide detailed documentation of a bacterial urine infection.
However, making a culture is costly, and takes at least 24 hours.
An ideal test requires only limited technical expertise, is cheap
and has a high accuracy, enabling a quick diagnosis in high-risk
patients [2,8].
One example is the dipstick test, where only nitrites and leukocyte
esterase – and not proteins and blood – show fair accuracy, compared
with a quantitative culture [9].
In the past 25
years, many studies have evaluated the accuracy of dipsticks
tests as rapid detectors of bacteriuria and UTI in different
populations and age groups. Several narrative reviews have been
written [6,9-12],
and two meta-analyses [1,13]
have been performed. The meta-analysis by Hurlbut and Littenberg
[13]
did not report on sources of heterogeneity. The most recent meta-analysis
[1]
of 26 studies in children, showed major heterogeneity of diagnostic
accuracy across studies, which could not be fully explained by
differences in age, or by differences in the definition of the
criterion standard. The lack of an adequate explanation for the
heterogeneity of the dipstick accuracy stimulates an ongoing
debate. Many elements and differences in the process of urine-collection
and analysis, and in the selection of patients, may influence
the presence of micro-organisms which can be detected by the
dipstick, as well as the presence of substances that may give
false results [10,14-16].
The methodological quality of the studies might also be an important
determinant of the reported accuracy [17].
The objective
of the present meta-analysis was to summarise the available evidence
on the diagnostic accuracy of the urine dipstick test, taking
into account various pre-defined potential sources of heterogeneity.
|
Literature search
Standardised searches were conducted in 1998 and 1999 in computerised
databases (Medline and Embase), by reference tracking [ 18]
and through personal contacts with experts in the field of research.
In January 2000, the search was extended and updated by conducting
an on-line Medline search at the PubMed website http://www.pubmedcentral.gov/redirect3.cgi?&&reftype=extlink&artid=434513&iid=11309&jid=67&&http://www.ncbi.nlm.nih.gov/pubmed Table
5 [see Additional
file 1].
Two reviewers (WLJMD,
JCY) selected the studies. The following inclusion criteria were
applied: publications should concern the diagnosis of bacteriuria
or urinary tract infections, investigate the use of dipstick tests
for nitrites and/or leukocyte esterase, and present empirical data.
Excluded were studies which focused only on sexually transmitted
diseases, urethritis or schistosomiasis, studies with no accepted
criterion standard (at least semi-quantitative or quantitative
urine culture), studies which did not provide sufficient data for
the reconstruction of a diagnostic two-by-two table, and studies
which based test positivity on the combination of various other
tests jointly with tests for nitrites and/or leukocyte esterase.
Studies carried out before 1990 and studies in animals were also
excluded. There were no language restrictions. When consensus was
not reached, a third reviewer (NPD) was consulted to resolve disagreements.
Quality and applicability of studies
The checklist of the Cochrane Methods Working Group on Meta-analysis
of Diagnostic and Screening Tests was used to assess the methodological
quality of the selected studies [ 19]
(available on request). Three reviewers (CJY, NPvD, WLJMD) independently
assessed all selected publications. Disagreements were resolved in
consensus meetings.
Internal validity
criteria (IV) were scored as 'positive' (adequate methods), 'negative'
(inadequate methods, potential bias) or 'no information'. External
validity criteria (EV) were scored positive if sufficient information
was provided to assess the generalisability of the findings. Sub-totals
were calculated separately for internal validity (maximum 8) and
external validity (maximum 15 for nitrites or 16 for leukocyte-esterase),
and percentages of the maximum possible score were calculated.
Estimates are presented with 95% confidence intervals (95% CI).
Potential sources of heterogeneity
For each publication detailed information was abstracted on: the
colony count used to define UTI (cut-off used for the criterion standard),
exclusion criteria, setting, level of care, symptomatic or asymptomatic
bacteriuria, population sampled (children, general population, pregnant
women, etc), age of the study population, urine-collection procedures,
whether only first voided urine was collected, micro-organisms, procedures
followed when urine was contaminated, duration of transport of the
urine sample to the laboratory for culture, visual or automatic reading,
and person who was reading the dipstick. In addition, information
was collected on the year of study, disease prevalence at the setting,
sample-size, country in which the study was performed, brand of dipstick
and language of publication.
Meta-analysis
Data on sensitivity and specificity were derived from the original
publications. If absolute data were not presented, published sensitivity
and specificity data were used to reconstruct two-by-two tables.
Sensitivity and specificity were pooled after natural logarithmic
transformation. The average predictive values were calculated on
the basis of geometric means of sensitivity and specificity using
the weighted mean prevalence in the sub-group of studies at issue.
The diagnostic odds ratio (DOR) of each individual study was calculated
according to the following formula [ 20, 21]:

The DOR represents
the ratio of the odds of a positive test result in the diseased
group to the odds of a positive test result in the non-diseased
group. A DOR of 1 means that the test has no discriminative power.
When the DOR is more than one, the odds of a positive test result
are higher in the diseased population. Pooling of the DOR was also
performed after natural logarithmic transformation [ln(DOR)].
The statistical
heterogeneity of sensitivity, specificity and the ln(DOR) across
studies was tested by a ?2 test of independency with k-1
degrees of freedom (k = number of studies) [22].
As the validity of weighting by the inverse of the variance of
the DOR is still under debate for meta-analysis of diagnostic studies
[23],
only the results of fixed unweighted pooling are presented. Outliers
were detected by means of the Galbraith plot [24].
If a factor was significantly associated with outlying results
(according to logistic regression), all studies with that factor
were excluded from further analysis.
In case of negatively
associated pairs of sensitivity and specificity, and a homogeneous
ln(DOR), a regression line was fitted as a Summary ROC curve (SROC)
[20,25]
in a scatter plot of the various studies included, with their sensitivity
on the y-axis and (1 – specificity) on the x-axis. If sensitivity
and specificity are negatively associated, it may be assumed that
they represent a single DOR and that any variation between the
pairs is caused by the use of different cut-off points for the
test across studies. Dependency of the ln(DOR) on the cut-off point
(S) can be tested using meta-regression analysis:
ln(DOR)
= a + ßS
If pairs of sensitivity
and specificity still showed weak negative or no association, and
if sensitivity or specificity was heterogeneous, sub-group analyses
of the ln(DOR) were performed by means of ANOVA. All individual
validity criteria, and all pre-defined potential sources of heterogeneity
mentioned above, were used for sub-group analyses. Association
with continuous variables was tested in univariate meta-regression
analysis of the ln(DOR).
After sub-group
analyses, all sources of heterogeneity associated with ln(DOR)
up to p = 0.25 were selected for a multiple meta-regression analysis,
to study the presence of independent factors associated with the
ln(DOR).
Analyses were performed
with SPSS 7.5 for Windows95 and with Meta-test [26].
For a more detailed description of the model used in this analysis,
reference is made to Midgette et al. [27]
and Devillé et al [28].
The accuracy of
the dipstick test for nitrite and leukocyte-esterase was studied
both separately and in combination: positive results for either
nitrites or leukocyte-esterase or for both.
|
Literature search
The search strategy identified 220 publications, of which 70 [ 29- 98]
met the inclusion and exclusion criteria. Five selected publications
[ 94- 98]
were only detected by the search in EMBASE (n = 1), by reference
tracking (n = 1) or personal contacts (n = 3). See Table 6 [see Additional
file 2] for the main characteristics of the publications included.
150 publications were excluded from meta-analysis for the following
reasons: they did not report on the accuracy of the urine dipstick
test (nitrites and/or leukocyte esterase) for the diagnosis of
UTI or bacteriuria (n = 99), they were reviews (n = 22), they did
not use culture as a criterion standard (n = 6), they did not base
test positivity only on nitrites and/or leukocyte esterase (n =
6), or they did not include sufficient data to calculate the diagnostic
two by two table (n = 17). The 70 selected publications represent
studies from 18 countries in five continents, published in seven
languages. Two selected publications present the results of two
different studies. Therefore, 72 different studies were included,
17 of which studied nitrites only, and 2 studied leukocyte-esterase
only. The other studies evaluated different combinations of both.
Quality and applicability of studies
The mean score for internal validity was 72% (95% CI 69 to 75). Nine
publications used the culture on dipslide as a criterion standard.
Nine publications (13%) concerned double-blind studies; only two
were clearly hampered by verification bias. In 65% of the studies
the dipstick test was evaluated with the help of clinical information.
The mean score
for external validity was 69% (95% CI 65 to 73). Some outpatient
departments provided care at primary level, resulting in 15 primary
care studies (21%). 17 studies (24%) did not provide details about
the general population studied. Sixty percent of the studies did
not mention any exclusion criteria; 20% gave no information on
the way in which urine was collected, and 86% did not state whether
first-voided urine was collected. Information on mixed or contaminated
cultures was not available in over 50% of the studies (details
are available on request).
Meta-analysis
Nitrites (n = 58)
Sensitivity and specificity were poorly correlated (Spearman ? =
-0.377) and highly heterogeneous (Q = 776 and 9609, respectively,
df 57). So was the ln(DOR) (Q = 145, df 57). On the Galbraith plot,
22 studies were outside the 95% bounds (+/-2Z) from the standardised
mean ln(DOR). Univariate logistic regression revealed an association
of outliers with lower categories of internal and external validity
(internal = 50%: OR = 15.9, 95% CI 1.1 to 233.2, external = 75%:
OR = 4.2, 95% CI 1.2 to 15.1). Therefore, studies in the lowest categories
of internal or external validity (=50%) were excluded from further
sub-group analysis and meta-regression (n = 12, references: [ 36, 37, 44, 62, 71, 72, 77, 78, 82, 83, 88, 89]).
The ln(DOR) remained
heterogeneous (Q = 125, df 45). Univariate sub-group analyses revealed
statistically significant differences in the ln(DOR) between several
sub-categories of internal validity (blinding and prospective versus
retrospective data collection) and external validity (types of
patient population and care setting) (Table 1).
The ln(DOR) was
also univariately associated with the cut-off point of the dipstick
used in the evaluations (ß = -0.439, 95% CI -0.606 to -0.272),
pre-test probability (ß = -4.54, 95% CI -6.499 to -2.082) and year
of publication (ß = -0.197, 95% CI -0.197 to -0.013).
Further analysis
within sub-groups showed the following results:
• blinding: only
in double-blind studies were sensitivity and specificity found
to be highly negatively correlated (? = -0.647) with a homogeneous
ln(DOR). In unblinded studies the ln(DOR) was associated with the
cut-off point for a positive result of the dipstick, and in single
blind studies it was associated with both the cut-off point and
the general population;
• patient populations: sensitivity
and specificity were highly negatively correlated in studies involving
general populations (? = -0.539), pregnant women (? = -0.559) and
surgery patients (? = -1.00), resulting in a homogeneous ln(DOR).
In multiple meta-regression, the ln(DOR) for studies in general
populations was associated with the cut-off point of the dipstick,
supra-pubic urine-collection and automatic or visual reading. For
studies in pregnant women it was associated with the presence of
clinical information, and for studies in children it was associated
with the cut-off point of the dipstick only;
• care setting: strong
negative correlations existed between sensitivity and specificity
in family practices (? = -0.714) and emergency departments (? =
-0.400) with a homogeneous ln(DOR) in both sub-groups. In multivariate
meta-regression analysis, the ln(DOR) was associated in family
practices with the pre-test probability; in outpatient departments
it was associated with the cut-off point of the dipstick, and in
inpatient departments with the cut-off point, pre-test probability,
automatic or visual reading, and the presence of clinical information.
Multiple meta-regression
analysis of all studies revealed an independent association of
the ln(DOR) with the cut-off point of the dipstick (ß = -0.348,
95% CI -0.505 – -0.192), studies executed in pregnant women (ß =
1.082, 95% CI 0.178 – 1.985), in general populations (ß = -0.772,
95% CI -1.601 – 0.057) or in elderly people (ß = 1.457, 95% CI
0.022 – -2.882) (adjusted R2 regression model: 0.55).
For details on
sensitivity, specificity, odds ratios and predictive values, see
Table 1.
Post-test probabilities at different pre-test probabilities for
different patient populations and care settings are shown in Table 4,
and Figure 1 and 2.
Leukocyte-esterase (n = 42)
On the Galbraith plot 10 studies were outside the 95% bounds (+/-2Z)
from the standardised mean ln(DOR). Univariate logistic regression
revealed an association of outlier studies with lower categories
of external validity (external = 50%: OR = 32, 95% CI 2.3 to 447).
Studies in the lowest category (=50%) of internal validity (n = 1,
reference: [ 81])
and external validity (n= 6, references: [ 37, 62, 71, 80, 83, 89])
were excluded from the analysis. Sensitivity and specificity were
correlated after exclusion (Spearman ? = -0.635), but remained heterogeneous
(Q = 368 and 1799, df 34), as did the ln(DOR) (Q = 64, df 34).
Univariate sub-group
analyses showed statistically significant differences in the ln(DOR)
between sub-categories of external validity (Table 2):
disease (UTI versus bacteriuria), type of patient population, care
setting, method of urine-collection, reported exclusion criteria,
and brand of dipstick. The ln(DOR) was not associated with the
cut-off point for a positive leucocyte-esterase test. Further analysis
of sub-groups showed that sensitivity and specificity were strongly
negatively correlated in the non-urology studies (? = -0.798),
as well as in the two urology studies (? = -1.00) resulting in
a homogeneous ln(DOR) in the non-urology sub-group. Multiple meta-regression
analysis in the non-urology studies showed an association of the
ln(DOR) with the cut-off point of the dipstick, the disease and
the family physician reading the test, but not with setting of
care. At this level an interaction existed between disease and
family physician reading the test (adjusted R2 regression
model: 0.42). All other associations disappeared.
For details on
sensitivity, specificity, odds ratios and predictive values, see
Table 2.
Post-test probabilities for different care settings are shown in
Table 4 and
Figure 1.
Nitrite and leucocyte-esterase:
one or both positive (n = 39)
Eleven studies were outliers; low internal validity (n = 3, references:
[ 29, 36, 82])
and supra-pubic urine-collection (n = 1, reference: [ 49])
were associated with outlying results: these studies were excluded.
Sensitivity and specificity were weakly correlated (Spearman ? =
-0.227), and both remained heterogeneous. The ln(DOR) was homogeneous
(Q = 41, df 34).
The ln(DOR) was
univariately associated with the cut-off point of the criterion
standard, the availability of clinical information, population
groups and brand of dipstick (Table 3).
Sensitivity and specificity were negatively correlated in the sub-group
of the general population (? = -0.406), in children (? = -0.417),
surgery patients (? = -1.0) and urology patients (? = -0.50). Sensitivity
was homogeneous in pregnant women, surgery and urology, specificity
was homogeneous in the later two groups. The ln(DOR) was homogeneous
in all population groups.
Multivariate regression
analysis retained the following independent factors: a cut-off
point for the criterion standard of 1000 mcu/ml (1 study only, ß =
-1.823, 95% CI -3.629 – -0.017), studies in children (ß = 1.176,
95% CI 0.477 – 1.875), studies in urology patients (ß = 1.184,
95% CI 0.103 – 2.264) and the presence of clinical information
(ß = 0.893, 95% CI 0.259 – 1.527) (adjusted R2 regression
model: 0.39). The model did not change when excluding the one study
with the low criterion standard cut-off point (1000 mcu/ml).
For details on
sensitivity, specificity and odds ratios, see Table 3.
Post-test probabilities for different patient populations and care
settings are shown in Table 4,
and Figure 1 and 2.
Nitrite and leucocyte-esterase positive
(n = 14)
Four studies were outliers, of which two had low external validity.
As no factor was associated with the outliers, no studies were excluded.
Sensitivity and specificity were negatively correlated (Spearman
? = -0.275), and were both heterogeneous, as was the ln(DOR) (Q =
43, df 13).
The diagnostic
odds ratio was associated with the cut-off point of the criterion
standard and with population groups (Table 3).
It was also associated with the cut-off point of the dipstick (ß =
-0.421, 95% CI -0.071 to -2.308), because of one study [46]
that used a cut-off point of 1000 mcu/ml for the criterion standard.
Sensitivity and specificity were negatively associated after exclusion
of this study (Spearman ? = -0.36), but remained heterogeneous,
as did the ln(DOR). The ln(DOR) was only homogeneous in studies
on children (Q = 9, df 5, Spearman ? = -0.49).
In multivariate
meta-regression, the independent factors were: studies in general
populations, studies in surgery patients and one study with a criterion
cut-off of 1000 mcu/ml. When excluding this last-mentioned study,
studies in general populations (ß = -2.312, 95% CI -3.950 to -0.675)
and studies in surgery patients (ß = -2.846, 95% CI -5.435 to -0.257)
remained in the regression model (adjusted R2 regression
model: 0.50).
For details on
sensitivity, specificity and odds ratios see Table 3.
Post-test probabilities for different patient populations are in
Table 4 and
Figure 2.
|
Quality of the evidence
Before discussing the accuracy of the dipstick itself, one must
take into account the amount and quality of the available evidence.
The search was extensive, and identified a large number of studies
published during the nineties. The quality of the research, as
could be derived from the publications, was reasonable: 70% of
the selected studies had an internal validity score which was approximately
70% of the maximum score. Only one in three publications had an
external validity score of 75% or more. The importance of internal
and external validity becomes clear from the fact that low scores
were predominantly found among the outliers in this meta-analysis.
A good description of the study population, using explicit selection
criteria, is important: a major part of the existing heterogeneity
in this meta-analysis could be explained by differences between
study populations. The majority of the publications gave no information
on important factors (such as the handling of contaminated samples
or mixed cultures, or the micro-organisms cultured), which did
not facilitate evaluation.
Nitrites
Overall, the sensitivity of the urine dipstick test for nitrites
was low (45 – 60% in most situations) with higher levels of specificity
(85 – 98%). The typically low pre-test probabilities resulted in
high predictive values of negative test results. The test for nitrites
had its highest accuracy in specific populations such as pregnant
women, urology patients and elderly people. Only in the elderly
did the test for nitrites reach a high sensitivity, while in pregnant
women sensitivity was the lowest, confirming the results reported
by Patterson [ 5].
Although statistically not significant, the test for nitrites might
perform better in asymptomatic patients and in patients who are
not on antibiotics, confirming the results reported by Beer [ 14].
In multivariable
analysis the accuracy of the dipstick for nitrites was affected
only by the cut-off point for the nitrites and the population tested.
The differences between the studies with regard to implicit cut-off
points may be effected by human, instrumental or environmental
factors.
Patient populations
and care setting were highly correlated. Pre-test probabilities
differed between some levels of care. While it is often expected
that pre-test probability increases with each level of the health
care system, in this study it was found to be higher in family
physician or primary care studies, compared to hospital studies.
Family physicians apparently use the dipstick test to diagnose
an infection based on clinical signs and symptoms, while hospital-based
physicians order a dipstick test to screen patients to exclude
the presence of an infection.
Leukocyte-esterase
Sensitivity of the urine dipstick test for leukocyte-esterase was,
in general, slightly higher than for the dipstick test for nitrites
(48 – 86%), while the specificity was slightly lower (17 – 93%).
Generally, this resulted in a lower accuracy, compared to the test
for nitrites, lower predictive values of positive test results and
similar predictive values of negative test results.
The heterogeneity
of the results of the urine dipstick test for leukocyte-esterase
was only caused by factors related to external validity. Accuracy
was higher for the detection of symptomatic UTI, compared to asymptomatic
bacteriuria, as opposed to the test for nitrites. The leucocyte-esterase
test had a much higher accuracy in urology patients, and consequently
also in tertiary care, and when using a catheter for urine-collection.
Sensitivity is highest in primary care, but requires further diagnostic
work-up because of the high rates of false positives. In primary
care negative results do not exclude the presence of infection.
Combination of nitrites and leukocyte-esterase
Combining the results of both parts of the dipstick tests with one
or both showing a positive result increased sensitivity (68 to 88%),
but had different effects on specificity. The considerable false
positive rates weigh upon the predictive values of positive test
results, as reported earlier [ 10].
This resulted in different effects on accuracy, but increased the
predictive values of a negative test result in all study populations,
except studies in general populations. A negative dipstick test result
excluded the presence of infection in most studies, contrary to the
findings of Hurlbut et al. [ 13].
Accuracy was highest in urology patients, surgery patients and in
children. No differences were found between symptomatic UTI and asymptomatic
bacteriuria, as was reported by Pelgrom [ 12].
When both tests were positive specificity increased, also raising
the predictive value of a positive result to an acceptable level
in general populations.
Recommendations for practice
Care setting and patient population are the major sources of heterogeneity.
Consequently, these factors should be taken into account for optimal
test use in different clinical circumstances. In the general population
a negative test result for one of both tests has a sufficient predictive
value to exclude disease, and when both test results are positive
there is sufficient evidence to rule in infection. Also in children,
pregnant women, surgery or urology populations a negative result
for both tests rules out infection, while a positive nitrite test
still needs working-up, although the probability of infection increases
considerably. In the elderly a negative test result for both tests
rules out infection, while a positive nitrite test rules in infection.
Post-test probabilities of positive leucocyte-esterase are low in
all population subgroups.
A family physician
should take these considerations in specific population groups
into account, but in non-specific patients in a general practice
a positive nitrite test rules in infection. On the other hand,
if both tests are available and one of them is negative, confirmation
remains necessary, because of the amount of false positive results.
In other settings clinicians may exclude infection on the basis
of a single negative test result.
Criterion standard
For nitrites and leukocyte-esterase both separately or combined,
the use of a more stringent definition of infection by increasing
the cut-off point of the culture raised accuracy significantly. The
lower cut-off point, at less than 1,000 mcu/ml, used mainly in supra-pubic
urine-collection, resulted for nitrites in a higher accuracy through
higher sensitivities. The present findings do not demonstrate systematically
higher false positive rates with more stringent definitions of infection,
as was observed by Gorelick [ 1].
The lowest cut-off point had higher false positive rates, but not
the cut-off point at 10 5 mcu/ml.
|
Research in this
field can still be improved by implementing clear inclusion and
exclusion criteria, and by double-blind study designs. Reporting
on the distribution of micro-organisms, the way in which urine
is collected, the time delay between collection and analysis,
whether only first-voided urine was collected, the handling of
mixed cultures and contaminated urine samples, and who was reading
the test, may improve future systematic reviews of test accuracy.
If sample-sizes are adequate, the publication of results for
relevant sub-groups may also increase the quality of future diagnostic
studies in this field. Although this meta-analysis covers the
evidence published over the last decade, the validity of its
results is also limited by the limited specifications given in
the publications. As specific patient populations – a proxy-indicator
for spectrum of disease – seem to be the major source of heterogeneity
of accuracy, more details about patients in different clinical
settings might increase the validity of a future meta-analysis.
Overall, this
review demonstrates that the urine dipstick test alone seems
to be useful in all populations to exclude the presence of infection
if the results for nitrites or leukocyte-esterase are negative.
Sensitivities of the combination whereby one or both test results
are positive vary between 68 and 88% in different patient groups,
but positive test results have to be confirmed or pre-test probabilities
have to be high on the basis of the clinical history and/or a
combination of other tests. In family practice, the combination
of both tests with at least one positive result is very sensitive,
but because of its low specificity remains the usefulness of
the dipstick test alone doubtful, even with high pre-test probabilities.
|
|
|
WLJMD, JCY and
NPvD conducted the searches, read the papers, extracted the data
and analysed the methodological quality of the papers. Statistical
analysis was done by WLJMD, and checked by PDB, DAWMvdW and LMB.
All authors contributed equally to writing and reviewing the
paper.
|
|
|
Table 5
PubMed search
strategy for literature concerning the diagnosis of bacteriuria
or urinary tract infections by measuring nitrites and/or leukocyte
esterase with a dipstick, 1990 – 1999
Additional File 6
Studies (n
= 72)* included in the meta-analysis of the urine dipstick
for diagnosing bacteriuria or urinary tract infections by measuring
nitrites and/or leukocyte esterase.
|
We thank Ms.
Mika van der Leden for her translation of the Japanese publications.
|
- Gorelick MH, Shaw KN. Screening tests for
urinary tract infection in children: A meta-analysis. Pediatrics 1999;104:e54. [PubMed] [Free Full Text]
- Brooks D. The management of suspected urinary
tact infection in general practice. Br J Gen
Pract 1990;40:399–402. [PubMed]
- Nazareth I, King M. Decision making by general
practitioners in diagnosis and management of lower urinary
tract symptoms in women. BMJ 1993;306:1103–6. [PubMed]
- Andriole VT, Patterson TF. Epidemiology,
natural history and management of urinary tract infections
in pregnancy. Med Clin N Am 1991;75:359–73. [PubMed]
- Patterson TF, Andriole VT. Detection, significance
and therapy of bacteriuria in pregnancy. Update in the managed
health care era. Inf Dis Clin N Am 1997;11:593–608.
- Romero R, Oyarzun E, Mazor M, Sirtori M,
Hobbines JC, Bracken M. Meta-analysis of the relationship
between asymptomatic bacteriuria and preterm delivery/low
birth weight. Obstet Gynecol 1989;73:576. [PubMed]
- Wolfhagen MJHM, Hoepelman IM, Verhoef J.
Urineweginfectie bij ouderen, wat is de betekenis? [Urinary
tract infection in elderly people: what does it mean?]. Ned
Tijdschr Geneeskd 1990;134:470–2. [PubMed]
- Cochat P, Dubourg L, Koch Nogueira P, Peretti
N, Vial M. French: Urine analysis by dipstick. Arch
Pédiatr 1998;5:65–70.
- Lohr JA. Use of routine urinalysis in making
a presumptive diagnosis of urinary tract infection in children. Pediatr
Infect Dis J 1991;10:646–50. [PubMed]
- The U.S. Preventive Services Task Force.
Screening for asymptomatic bacteriuria, hematuria and proteinuria. Am
Fam Physician 1990;42:389–95. [PubMed]
- Fihn SD. Lower urinary tract infection
in women. Curr Opinion Obstet Gyneco 1992;4:571–8.
- Pelgrom J, de Maeseneer J. De dipstickmethode:
vaarwel urinesediment? [The dipstick: goodbye microscopy?]. Huisarts
Nu 1995;24:8–11.
- Hurlbut TA, Littenberg B. the Diagnostic
Technology Assessment Consortium. The diagnostic accuracy
of rapid dipstick tests to predict urinary tract infection. Am
J Clin Path 1991;96:582–8. [PubMed]
- Beer JH, Vogt A, Neftel K, Cottagnoud P.
False positive results for leucocytes in urine dipstick test
with common antibiotics. BMJ 1996;313:25. [PubMed] [Free Full Text]
- Gallagher EJ, Schwartz E, Weinstein RS.
Performance characteristics of urine dipsticks stored in
open containers. Am J Emerg Med 1990;8:121–3. [PubMed]
- Edwards A, Granier S. Packaging may lead
to false positive results. BMJ 1996;313:1010. [PubMed]
- Lijmer JG, Mol BW, Heisterkamp S, Bonsel
GJ, Prins MH, van der Meulen JH, Bossuyt PM. Empirical evidence
of design-related bias in studies of diagnostic tests. JAMA 1999;282:1061–6. [PubMed] [Full Text]
- van der Weijden T, Yzermans CJ, Dinant
GJ, van Duijn NP, de Vet R, Buntinx F. Identifying relevant
diagnostic studies in MEDLINE. The diagnostic value of the
erythrocyte sedimentation rate (ESR) and dipstick as an example. Fam
Pract 1997;14:204–8. [PubMed] [Free Full Text]
- Cochrane Methods Working Group on Screening
and Diagnostic Tests. Recommended methods. http://www.pubmedcentral.gov/redirect3.cgi?&&reftype=extlink&artid=434513&iid=11309&jid=67&&http://www.cochrane.org/cochrane/sadt.htm Accessed
21 May 2004.
- Moses LE, Shapiro D, Littenberg B. Combining
independent studies of a diagnostic test into a summary ROC
curve: data-analytic approaches and some additional considerations. Stat
Med 1993;12:1293–1316. [PubMed]
- Irwig L, Macaskill P, Glasziou P, Fahey
M. Meta-analytic methods for diagnostic accuracy. J
Clin Epidemiol 1995;48:119–30. [PubMed] [Full Text]
- Fleiss, JL. Statistical methods for rates
and proportions. New-York: Wiley; 1973.
- Irwig L, McAskill P, Glasziou P, Fahey
M. Meta-analytic methods for diagnostic test accuracy. J
Clin Epidemiol 1995;48:119–30. [PubMed] [Full Text]
- Galbraith RF. A note on graphical presentation
of estimated odds ratios from several clinical trials. Stat
Med 1988;7:889–94. [PubMed]
- Rutter CM, Gatsonis CA. Regression methods
for meta-analysis of diagnostic test data. Acad
Radiol 1995;2:S48–S56. [PubMed]
- Lau, J. Meta-test version 06. New England
Medical Center, Boston; 1997.
- Midgette AS, Stukel TA, Littenberg B. A
meta-analytic method for summarising diagnostic test performances:
Receiver-operating-characteristic-summary point estimates. Med
Decis Making 1993;13:253–7. [PubMed]
- Devillé, WL.; Buntinx, F.; Bouter, LM.;
Montori, VM.; De Vet, HC.; Van Der Windt, D.; Bezemer, PD.
Conducting systematic reviews of diagnostic studies: didactic
guidelines. BMC Med Res Methodol. p. 9. [
Free Full text in PMC]
- Semeniuk H, Church D. Evaluation of the
leukocyte esterase and nitrite urine dipstick screening tests
for detection of bacteriuria in women with suspected uncomplicated
urinary tract infections. J Clin Microbiol 1999;37:3051–2. [
Free Full text in PMC]
- Waisman Y, Zerem E, Amir L, Mimouni M.
The validity of the uriscreen test for early detection of
urinary tract infection in children. Pediatrics 1999;104:e41. [PubMed] [Free Full Text]
- Munyi ST, Macharia WM, Alwar AJ, Njeru
EK. Screening for urinary tract infection in children with
cancer. East Afr Med J 1998;75:264–7. [PubMed]
- Sharief N, Hameed M, Petts D. Use of rapid
dipstick tests to exclude urinary tract infection in children. Br
J Biomed Sci 1998;55:242–6. [PubMed]
- Shaw KN, McGowan KL, Gorelick MH, Schwartz
JS. Screening for urinary tract infection in infants in the
emergency department: which test is best? Pediatrics 1998;101:e1. [PubMed] [Free Full Text]
- Tincello DG, Richmond DH. Evaluation of
reagent strips in detecting asymptomatic bacteriuria in early
pregnancy: prospective case series. BMJ 1998;316:435–7. [PubMed] [Free Full Text]
- Zaman Z, Borremans A, Verhaegen J, Verbist
L, Blanckaert N. Disappointing dipstick screening for urinary
tract infection in hospital inpatients. J Clin
Pathol 1998;51:471–2. [PubMed] [Free Full Text]
- Edwards A, van der Voort J, Newcombe R,
Thayer H, Verrier Jones K. A urine analysis method suitable
for children's nappies. J Clin Pathol 1997;50:569–72. [PubMed]
- Hoberman A, Wald ER. Urinary tract infections
in young febrile children. Pediatr Infect Dis
J 1997;16:11–7. [PubMed] [Full Text]
- Rivierre P, Dauphin L, Lemonnier JY, Rea
C, Chavanne D, Gauvain JB. Infection urinaire en court séjour
gériatrique: intérêt de la bandelette urinaire. [Urinary
infection in geriatric short stay: value of urinary strips]. Rev
Med Interne 1997;18:765–8. [PubMed] [Full Text]
- Shaw KN, McGowan KL. Evaluation of a rapid
screening filter test for urinary tract infection in children. Pediatr
Infect Dis J 1997;16:283–7. [PubMed] [Full Text]
- Hagay Z, Levy R, Miskin A, Milman D, Sharabi
H, Insler V. Uriscreen, a rapid enzymatic urine screening
test: useful predictor of significant bacteriuria in pregnancy. Obstet
Gynecol 1996;87:410–3. [PubMed] [Full Text]
- Jellheden B, Norrby RS, Sandberg T. Symptomatic
urinary tract infection in women in primary health care.
Bacteriological, clinical and diagnostic aspects in relation
to host response to infection. Scand J Prim Health
Care 1996;14:122–8. [PubMed]
- Leanos-Miranda A, Contreras-Hernandez I,
Camacho R, Villagomez-Salcedo E, Cervantes-Gorayeb I. Rendimiento
diagnóstico de algunas pruebas en orina en las infecciones
de vías urinarias. [Diagnostic yield of various urine tests
in urinary tract infections]. Rev Invest Clin 1996;48:117–23. [PubMed]
- Osterberg E, Aspevall O, Grillner L, Persson
E. Young women with symptoms of urinary tract infection.
Prevalence and diagnosis of chlamydial infection and evaluation
of rapid screening of bacteriuria. Scand J Prim
Health Care 1996;14:43–9. [PubMed]
- Zainal D, Baba A. The value of positive
nitrites in screening asymptomatic bacteriuria amongst Malaysian
school children. Southeast Asian J Trop Med Public
Health 1996;27:184–8. [PubMed]
- Bailey BL Jr. Urinalysis predictive of
urine culture results. J Fam Prac 1995;40:45–50.
- Holland DJ, Bliss KJ, Allen CD, Gilbert
GL. A comparison of chemical dipsticks read visually or by
photometry in the routine screening of urine specimens in
the clinical microbiology laboratory. Pathology 1995;27:91–6. [PubMed]
- Mimoz O, Bouchet E, Edouard A, Costa Y,
Samii K. Limited usefulness of urinary dipsticks to screen
out catheter-associated bacteriuria in ICU patients. Anaesth
Intensive Care 1995;23:706–7. [PubMed]
- Monane M, Gurwitz JH, Lipsitz LA, Glynn
RJ, Choodnovskiy I, Avorn J. Epidemiologic and diagnostic
aspects of bacteriuria: a longitudinal study in older women. J
Am Geriatr Soc 1995;43:618–22. [PubMed]
- Nunns D, Smith AR, Hosker G. Reagent strip
testing urine for significant bacteriuria in a urodynamic
clinic. Br J Urol 1995;76:87–9. [PubMed]
- Reed RP, Wegerhoff FO. Urinary tract infection
in malnourished rural African children. Ann Trop
Paediatr 1995;15:21–6. [PubMed]
- Winkens RA, Leffers P, Trienekens TA, Stobberingh
EE. The validity of urine examination for urinary tract infections
in daily practice. Fam Pract 1995;12:290–3. [PubMed]
- Carroll KC, Hale DC, Von Boerum DH, Reich
GC, Hamilton LT, Matsen JM. Laboratory evaluation of urinary
tract infections in an ambulatory clinic. Am
J Clin Pathol 1994;101:100–3. [PubMed]
- Fowlis GA, Waters J, Williams G. The cost
effectiveness of combined rapid tests (Multistix) in screening
for urinary tract infections. J R Soc Med 1994;87:681–2. [PubMed]
- Gerber B, Schmidt H, Ohde A. Zur Diagnostik
von Harnwegsinfektionen im Wochenbett. [Diagnosis of urinary
tract infections in puerperium]. Geburtshilfe
Frauenheilkd 1994;54:524–8. [PubMed]
- Hiraoka M, Hida Y, Hori C, Tuchida S, Kuroda
M, Sudo M. Rapid dipstick test for diagnosis of urinary tract
infection. Acta Paediatr Jpn 1994;36:379–82. [PubMed]
- Villanustre Ordóñez C, Buznego Sánchez
R, Rodicio García M, Rodrigo Sáez E, Fernandez Seara MJ,
Pavón Belinchón P, Castro-Cago M. Estudio comparativo de
los métodos semicuantitativos (leucocituria,
test de nitritios y Uricult) con el urocultivo para el diagnóstico
de infección urinaria en el lactante. [Comparative study
of semi-quantitative methods (microscopy of leucocytes, nitrites,
and Uricult) with urine culture for the diagnosis of urine
infection in infants]. Anales Españoles de Pediatria 1994;41:325–8.
- Ravichandran D, Daltrey I, Uglow M, Johnson
CD. Urine testing for acute lower abdominal pain in adults. Br
J Surg 1994;81:1460–1. [PubMed]
- Anderson JD, Chambers GK, Johnson HW. Application
of a leukocyte and nitrite urine test strip to the management
of children with neurogenic bladder. Diagn Microbiol
Infect Dis 1993;17:29–33. [PubMed]
- Bachman JW, Heise RH, Naessens JM, Timmerman
MG. A study of various tests to detect asymptomatic urinary
tract infections in an obstetric population. JAMA 1993;270:1971–4. [PubMed]
- Dalton MT, Comeau S, Rainnie B, Lambert
K, Forward KR. A comparison of the API Uriscreen with the
Vitek Urine Identification-3 and the leukocyte esterase or
nitrite strip as a screening test for bacteriuria. Diagn
Microbiol Infect Dis 1993;16:93–7. [PubMed]
- Perula de Torres LA, de Borja Ranz Garijo
F, Martinez de la Iglesia J, Blanco Negrede A, Acasuso Diaz
G, Crespo Crespo A, Lechuga Varona MT, Seco Pinero MI. Validacíon
de un método de diagnóstico rápido de infección urinaria
en población escolar. [The validation of a rapid diagnostic
method for urinary infection in the school-age population]. Rev
Clin Esp 1993;192:209–13. [PubMed]
- Etherington IJ, James DK. Reagent strip
testing of antenatal urine specimens for infection. Br
J Obstet Gynaecol 1993;100:806–8. [PubMed]
- Fabre R, Baudet JM, Cavallo JD, Crenn Y,
Meyran M. Évaluation de tests rapides de dépistage dans le
diagnostic des infection urinaires. [Evaluation of rapid
screening tests in the diagnosis of urinary infections]. Pathol
Biol (Paris) 1993;41:923–6. [PubMed]
- Liptak GS, Campbell J, Stewart R, Hulbert
WC Jr. Screening for urinary tract infection in children
with neurogenic bladders. Am J Phys Med Rehabil 1993;72:122–6. [PubMed]
- Lohr JA, Portilla MG, Geuder TG, Dunn ML,
Dudley SM. Making a presumptive diagnosis of urinary tract
infection by using a urinalysis performed in an on-site laboratory. J
Pediatr 1993;122:22–5. [PubMed]
- Nauschuetz WF, Harrison LS, Trevino SB,
Becker GR, Benton J. Two rapid urine screens for detection
of bacteriuria: an evaluation. Curr Microbiol 1993;26:43–5. [PubMed]
- Ibarra OJF, Vera GE, Garcia AJL. Utilidad
de dos pruebas para el diagnóstico presuntivo rápido en infección
de vías urinarias y embarazo. [Usefulness of two tests for
rapid diagnosis of urinary infections in pregnancy]. Ginecol
Obstet Mex 1993;61:290–4. [PubMed]
- Woodward MN, Griffiths DM. Use of dipsticks
for routine analysis of urine from children with acute abdominal
pain. BMJ 1993;306:1512. [PubMed]
- Blum RN, Wright RA. Detection of pyuria
and bacteriuria in symptomatic ambulatory women. J
Gen Intern Med 1992;7:140–4. [PubMed]
- Cooper J, Raeburn A, Hamilton-Miller JM,
Brumfitt W. Nitrite test for bacteriuria detection. Br
J Gen Pract 1992;42:346–7. [PubMed]
- Graninger W, Fleischmann D, Schneeweiss
B, Aram L, Stockenhuber F. Rapid screening for bacteriuria
in pregnancy. Infection 1992;20:9–11. [PubMed]
- Hellerstein S, Alon U, Warady BA. Urinary
screening tests. Pediatr Infect Dis J 1992;11:56–7. [PubMed]
- Kumawaza J, Matsumoto T. The dipstick test
in the diagnosis of UTI and the effect of pretreatment catheter
exchange in catheter-associated UTI. Infection 1992;20:S157–9. [PubMed]
- Lachs MS, Nachamkin I, Edelstein PH, Goldman
J, Feinstein AR, Schwartz JS. Spectrum bias in the evaluation
of diagnostic tests: lessons from the rapid dipstick test
for urinary tract infection. Ann Intern Med 1992;117:135–40. [PubMed]
- Madsen OR, Faber M, Philipsen L, Frimodt-Moller
N. Påvisning af bakteriuri hos ældre indlagte patienter.
Sammenligning af kombineret leukocyt- og nitritstix med dyrkning.
[Demonstration of bacteriuria in elderly hospitalised patients.
Comparison between leukocyte and nitrite strips and culture]. Ugeskr
Laeger 1992;154:3682–6. [PubMed]
- Michie JR, Thakker B, Bowman A, McCartney
AC. Evaluation of enzyme linked immunosorbent assay for screening
urinary tract infection in elderly people. J
Clin Pathol 1992;45:42–5. [PubMed]
- Mills SJ, Ford M, Gould FK, Burton S, Neal
DE. Screening for bacteriuria in urological patients using
reagent strips. Br J Urol 1992;70:314–7. [PubMed]
- Takagi S, Arakawa S, Matsumoto O, Kamidono
S, Terasoma K, Mita T. Japanese: Usefulness of dipstick test
for determining leukocytes and bacteria in urine. Hinyokika
Kiyo 1992;38:31–6. [PubMed]
- Evans PJ, Leaker BR, McNabb WR, Lewis RR.
Accuracy of reagent strip testing for urinary tract infection
in the elderly. J R Soc Med 1991;84:598–9. [PubMed]
- Fulcher RA, Maisey SP. Evaluation of dipstick
tests and reflectance meter for screening for bacteriuria
in elderly patients. Br J Clin Pract 1991;45:245–6. [PubMed]
- Lejeune B, Baron R, Guillois B, Mayeux
D. Evaluation of a screening test for detecting urinary tract
infection in newborns and infants. J Clin Pathol 1991;44:1029–30. [PubMed]
- Shaw KN, Hexter D, McGowan KL, Schwartz
JS. Clinical evaluation of a rapid screening test for urinary
tract infections in children. J Pediatr 1991;118:733–6. [PubMed]
- Weinberg AG, Gan VN. Urine screen for bacteriuria
in symptomatic pediatric outpatients. Pediatr
Infect Dis J 1991;10:651–4. [PubMed]
- Berger SA, Bogokowsky B, Block C. Rapid
screening of urine for bacteria and cells by using a catalase
reagent. J Clin Microbiol 1990;28:1066–7. [
Free Full text in PMC]
- Ditchburn RK, Ditchburn JS. A study of
microscopical and chemical tests for the rapid diagnosis
of urinary tract infections in general practice. Br
J Gen Pract 1990;40:406–8. [PubMed]
- Goldsmith BM, Campos JM. Comparison of
urine dipstick, microscopy, and culture for the detection
of bacteriuria in children. Clin Pediatr (Phila) 1990;29:214–8. [PubMed]
- Harlass FF, Duff P, Herd M. The evaluation
of urine pH in screening for asymptomatic bacteriuria in
pregnancy. Mil Med 1990;155:49–51. [PubMed]
- Hiscoke C, Yoxall H, Greig D, Lightfoot
NF. Validation of a method for the rapid diagnosis of urinary
tract infection suitable for use in general practice. Br
J Gen Pract 1990;40:403–5. [PubMed]
- Iitaka K, Sakai T, Oyama K, Izawa T, Igarashi
S. Screening for bacteriuria in Japanese school children. Acta
Paediatr Jpn 1990;32:690–5. [PubMed]
- Lévy M, Tournot F, Ledesert B, Muller C,
Carbon C, Yeni P. Evaluation du dépistage de l'infection
urinaire par la technique de la bandelette réactive. Chez
les malades hospitalisés. [Evaluation of the reagent strip
method to detect urinary tract infections. In hospital patients]. Presse
Med 1990;19:1359–63. [PubMed]
- Lorentzon S, Hovelius B, Miorner H, Tendler
M, Aberg A. The diagnosis of bacteriuria during pregnancy. Scand
J Prim Health Care 1990;8:81–3. [PubMed]
- McGlone R, Lambert M, Clancy M, Hawkey
PM. Use of Ames SG10 Urine Dipstick for diagnosis of abdominal
pain in the accident and emergency department. Arch
Emerg Med 1990;7:42–7. [PubMed]
- Pallares J, Casas J, Guarga A, Marquet
R, Solans P, Muxi C, Ibars I, Grifell E. Evaluación de diferentes
métodos de diagnóstico rápido en la detección de bacteriuria
asintomática en la gestante. [The evaluation of different
methods for rapid diagnosis in the detection of asymptomatic
bacteriuria in pregnant women]. Aten Primaria 1990;7:547–50. [PubMed]
- Tuel SM, Meythaler JM, Cross LL, McLaughlin
S. Cost-effective screening by nursing staff for urinary
tract infection in the spinal cord injured patient. Am
J Phys Med Rehabil 1990;69:128–31. [PubMed]
- García C, Gonzáles J, Arruebarrena D, Urbieta
MA, Emparanza J, Arriola M, Aurtenetxe A, Mingo T, Areses
R. Utilidad de la tira reactiva de orina en una consulta
de nefrología pediátrica: depistaje de la bacteriuria. [Utility
of the urine dipstick at a pediatric consultation of nephrology:
| |