INTRODUCTION
To detect caries lesions as early as possible is one of the main focuses for the current management of caries disease. At early stages, these lesions are non-cavitated, asymptomatic and difficult to detect clinically to the naked eye (Nyvad, 2004). In visualtactile examination, carious lesions are detected by inspection using a probe and a dental mirror. This method is quick, easy and affordable; however, for proximal areas this can be challenging due to may be masked by contact with the adjacent teeth (Haak et al., 2002). In addition, caries lesions can extend into the dentin without causing a visible cavitation (Akpata et al., 1996). Therefore, an accepted method for proximal caries detection involves a combination of visual-tactile examination and radiographic examination, with bitewing radiographs for assessment of proximal lesions (Pitts, 1996; Rindal et al., 2010).
It is of paramount importance that dental students be taught the skills to interpret bitewing radiographs. In addition, it is expected that the interpretation of dental students and dentists with greater clinical experience be as accurate and consistent as possible. However, similar to other diagnoses, the interpretation is subject to error and may lead to over- or under-treatment. Previous studies have reported wide variability of diagnostic accuracy for detection of caries on radiographs, and indicated that education and experience play a role in improving caries detection, however failed to achieve an acceptable range (Diniz et al., 2010; Shams et al., 2016). It is also known that the reliability of interpretation of dental radiographs may be affected by the education, training and experience of the observers, the quality of radiographs, the viewing environment and examiner knowledge of the subject matter (Valachovic et al.,1986; Rohlin et al., 1991; Stheeman et al., 1995). Therefore, investigate the effect of training and experience is interesting in bitewing radiograph assessment.
To the best of our knowledge, studies on the effects of clinical experience on diagnoses performance for proximal caries detection are scarce. Therefore, the purpose of this study was to investigate the accuracy and inter observer reliability of examiners with different levels of clinical experience. The hypothesis of this study is that the clinical experience of the examiners improves their accuracy and inter examiner reliability in detecting radiographic proximal caries lesions.
MATERIAL AND METHOD
Test image. Bitewing radiography set of an adult patient (male, 38-year-old) at the Operative Dentistry Clinic, University of Chile, was selected. The patient had a DMFT index corresponding to the average of the Chilean population (DMFT 14) (Urzua et al., 2012), according to a clinical examination conducted by an experienced professor. The bitewing radiographies were taken using a film holder, an x-ray machine (Sirona Heliodent Vario, Sirona Drive Suite 100 Charlotte, NC 28273, USA) at 65 kV, 7 mA and exposure time of 0.63 s for molars and 0.5 s for premolars.
Digital images of the radiography set were obtained scanning the films with Roviscan (Posdion Co., Korea) at 600dpi/14bits. The images are shown in Figure 1. The images were first assessed by a group of four experienced Dental and Maxillofacial Radiology specialists (with 1998 hours of specialist training and at least five years of experience as specialist). The images were displayed without any information such as name, gender, age or clinic history on a Laptop 15.5" 16: 9 (Sony Vaio SVS151A11U) with a resolution of 1920x1080 pixels (image size 16.27 x 11.44 cm). The laptop was placed on a comfortable desktop in a dark environment. Through consensus, they determined the presence or absence of proximal caries in the premolar and molar proximal surfaces, using the following scale: 0, sound; 1: enamel caries; 2: dentinal caries. Due to the nature of this in vivo study, this assessment was defined as the reference standard (Ferreira et al., 2006).

Fig. 1 Bitewing radiograph set selected for evaluation. (a) Right and (b) left premolars and molars, the teeth evaluated are numbered.
Experimental Groups. Three different groups of examiners assessed the bitewing radiographs, arranged as follow:
Group A. Third-year dental students (n=60), without clinical experience.
Group B. Fourth-year dental students (n=60), with 884 h of clinical experience.
Group C. Dentists (general practitioners) recently graduated (n=60), with at least 2250 h of clinical experience.
The examiners did not receive an additional training or calibration in interpretation of bitewing radiographs. Each examiner performed radiographic examination after instructed through oral orientation to consider proximal surfaces of premolars and molars, excepting third molars (32 proximal surfaces), and classify into 0: sound; 1: proximal radiolucency in enamel (enamel caries) and 2: proximal radiolucency in dentine (dentinal caries). To facilitate the radiographic examination and avoid errors outside the examination type, each tooth was numbered, and there was no time limit. First, images of the right region (molars and premolars) and then the left region were shown (Fig. 1). The images were displayed following the same protocol used with the group of radiology specialists.
Data analysis. Using the consensus of the four Dental and Maxillofacial Radiology specialists as a reference standard, the receiver operating characteristic (ROC) curve was calculated for each observer, in which the area under the curve was interpreted as accuracy. The Shapiro-Wilk test was used to assess the distribution of the sample and pairwise comparison of ROC curves was performed at enamel and dentin thresholds, by Mann-Whitney test using IBM SPSS Statistics v.21.0 (SPSS Inc. ChicagoIL-USA).
The inter observer reliability was calculated using the Fleiss’ Kappa value for each experimental group. Kappa values above 0.75 denoted excellent agreement, while values between 0.40 and 0.75 indicated good agreement, and less than 0.40 denote marginal agreement (Fleiss, 1981).
RESULTS
In this study, from the 32 proximal surfaces analysed, the consensus of the radiologist specialists revealed that nine (28.1 %) were caries free, 19 (59.3 %) had caries lesions restricted to enamel, and 4 (12.5 %) had dentinal caries lesions. Thus, a total of 23 (71.8 %) surfaces were considered to be positive for caries lesions.
The demographic characteristics of the examiners groups are shown in Table I. The accuracy values, according to the type of caries lesions for all groups are summarized in Table II. The best value of accuracy was presented by the group of dentists (0.58 and 0.63 for enamel and dentinal caries respectively).
Table I Descriptive characteristics of examiners groups.
Examiners groups | n (%) | Age (years) | ||||
male | female | total | male | female | ||
A | 28 (46.6%) | 32 (53.3%) | 60 | 21.2 | 20.9 | |
B | 31 (51.6%) | 29 (48.3%) | 60 | 22.4 | 21.8 | |
C | 27 (45%) | 33 (55%) | 60 | 24.5 | 24.2 |
Table II Average accuracy of radiographic examination for caries detection of the different examiners groups.
Examiners groups | Accuracy | ||
Enamel | Dentin | ||
A | 0,54 (0,067)a | 0,55 (0,08)a | |
B | 0,55 (0,075)ab | 0,61 (0,11)b | |
C | 0,58 (0,08)b | 0,63 (0,14)b |
Within columns, different letters indicate values with statistically significant difference (p<0.05).
Inter examiner reliability shown as Fleiss´s kappa values are presented in Table III. For enamel caries lesions, groups A and C presented the highest kappa, statistically significant higher than group B. However, for dentinal caries lesions group C presented the highest kappa, followed by group B and A. Overall, for the three groups, the inter examiner reliability denoted marginal agreement (<0.40).
Table III Fleiss´ Kappa values for interexaminer reliability of the different groups.
Examiners groups | Reliability | |||||
Enamel | Dentin | |||||
Kappa | CI | SE | Kappa | CI | SE | |
A | 0.249a | 0.227 - 0.272 | 0.012 | 0.058a | 0.037 - 0.076 | 0.012 |
B | 0.213b | 0.196 - 0.230 | 0.009 | 0.102b | 0.086 - 0.119 | 0.008 |
C | 0.282a | 0.248 - 0.316 | 0.017 | 0.344c | 0.310 - 0.377 | 0.017 |
Within columns, different letters indicate values with statistically significant difference (p<0.05).
DISCUSSION
There are few studies concerning the process of learning and developing diagnostic skills in radiographic interpretation (Wrbas et al., 2000; Mileman et al., 2004; Shams et al.). In this study, we aimed to evaluate the differences presented between dental students and graduated dentist with greater clinical experience, when detecting proximal caries lesions in bitewing radiographs. The results obtained showed better accuracy in graduated dentists, however, both professionals and dental student showed poor agreement in their radiographic diagnoses.
The third-year dental students included in this study have received theoretical knowledge through a one-year Dental Radiology course. In contrast, fourthyear dental students have an additional 884 hours of clinical practice and recently graduated dentists have completed a total of at least 2250 clinical hours. Therefore, the presented results seem to indicate that clinical experience plays a significant role in improving accuracy when diagnosing proximal caries lesions. These results, similarly to findings of Hellén-Halme and Petersson (Hellen-Halme & Petersson, 2010) indicate that theoretical knowledge is not in itself enough, and practice and experience are essential for detecting the presence of a carious lesion. The daily practice of removing caries lesions, while correlating it to their radiographic image, probably allows a better understanding of caries radiographic progression (Hellen-Halme & Petersson). In addition, it is suggested that dental students might have a greater tendency for false positives, since they have recently learned that every disease must be detected and properly treated, and for them not detecting a caries lesion could be considered a big error (Diniz et al.).
Other studies have evaluated whether education level and experience can affect diagnostic accuracy of caries lesions in radiographs (Lazarchik et al., 1995; Mileman & van den Hout, 2002; Hellen-Halme & Petersson). Hellen-Halme & Petersson reported that experienced dentists diagnosed radiographic proximal caries more accurately than dental hygiene students and dentistry students. Likewise, Mileman & van den Hout found better specificity in dentists than in dentistry students. However, they also showed worse sensitivity. The better sensitivity of the students in that work was explained as a consequence of a more liberal cut-off point in the student group (Mileman & van den Hout). Diniz et al. reported similar values of accuracy for dental practitioners and students for recognizing dentine caries from bitewing radiographs.
In the present study accuracy and interexaminer reliability were considered low, denoting marginal agreement (Fleiss). Similar results have been reported by Diniz et al. even with dentists with 5-7 years of experience. Comparable to this study, in the above mentioned study the majority of lesions were restricted to the enamel tissue, and this fact was stated to explained the low agreement values obtained (Diniz et al.). In contrast, in the studies of Lussi et al. (1995) and Ricketts et al. (1994), higher proportion of surfaces had deep dentinal caries and high sensitivity values were obtained. Therefore, a better performance could be expected if deeper caries lesions were included.
To accomplish an effective training in radiography interpretation, there should be numerous opportunities of corrective feedback (Wrbas et al.). During the education process of dental students, possibilities to compare clinical and radiographic image of caries lesions could be offered by the aid of computer-aided learning (Mileman et al., 2003, 2004; Wenzel, 1995). Unfortunately, other alternatives to improve skills of additional didactic instruction, based on lectures and discussion of case reports has not proven to be very effective (Wrbas et al.). Therefore, in order to raise the educational level in dental radiographic diagnosis, activities that aim to replicate the learning achieved by clinical experience, comparing clinical and radiographic outlining of lesions, could be suggested (Wrbas et al.).
The set of bitewing radiography chosen was not selected from a radiography bank, but from a real patient, taken under normal conditions. Other published studies assessing this issue have preferred the use of simulated bitewing radiographs taken from extracted teeth (Mileman & van den Hout; Hellen-Halme & Petersson). This has the potential benefit that later, the extracted teeth can be assessed with histological methods and these results used as a reference standard. The condition of not using bitewing radiographs from a bank can also be acknowledged as a positive innovation of this study. In their daily practice, clinicians are confronted with radiographs taken from real-life patients, which present differences in alignment of the teeth, bone structure, that are difficult to mimic with extracted teeth mounted on a cast.
Some possible limitation of the present study is the use of digitized radiographs. These images could add difficulties to the detection of lesions, due to reported increased physical noise of the radiographic image. (Wenzel et al., 1993; Møystad et al., 1996; Hintze et al., 2002) However, Prapayasatok et al. (2006) and Kimmes et al. (2006) did not report differences between digitized and conventional radiographs for caries detection, and it was stated that these images could be used for diagnosis.
In conclusion, based on the methodology used in this study, the accuracy and interexaminer reliability of bitewing radiographic examination were influenced by the clinical experience when detecting proximal caries. Low values of accuracy and interexaminer reliability were noted, reinforcing the necessity of improving education and training in radiographic caries detection. This study demonstrates the need for training and continuous learning for undergraduate students and dentists in order to improve performance in detecting radiographic caries lesions.