Clinical Chemistry and Laboratory Medicine (CCLM) is a journal reporting the majority of the important papers on reference values. The actual editor-in-chief, in his recent editorial  is in good agreement with the previous editor-in-chief who created a specific subheading on reference values in the journal.
We would like to present the genesis and the evolution of the concept through the activity of different groups at the international level starting with our own contribution, particularly in the Center of Preventive Medicine (CMP). We have asked friends involved with us in the development of the concept of reference values for comments and historical milestones.
First movement: allegro con fuoco (1965–1980)
This is the time for a triumphant automation. Laboratories became able to produce a large amount of data with a high level of quality. A series of articles consider the issues of biological and analytical variations in long-term studies in order to assess their possible physiological and medical implications [2–6]. Another group of scientists studied the factors contributing to intra-individual variability [7–12] and the effects of preanalytical factors on the intra-individual variation of some serum constituents .
In 1968, when we were asked to create and direct the laboratory of the CMP, we were in a good position to understand the importance of biological variability and genetic influences among healthy families. The medical director of the CMP was himself influenced by the work of Roger J. Williams  with his book on biochemical individuality.
In this context, it became evident that for the studies of markers of health deviation, it will be important to master and to better understand biological variability of all plasma constituents in healthy people, in healthy families and in chronic diseases. It was also time for a triumphant automation. Laboratories became able to produce a large amount of data with a high level of quality. The first articles on biological variation are published. Our co-workers started a series of PhD studies on nutritional influences, exercises, drug effects and also creating a biobanking organization for retrospective studies. Some other PhDs focused on biological variability of group constituents: enzymes, proteins and some prospective studies on the use of white blood cells. During more than 40 years, the group has produced 10 books and around 300 publications .
At the final round table of the first Pont-à-Mousson Biologie Prospective Conference in 1970 , M.C. Sanz and the discussants, particularly Pierre Metais, emphasized the importance for preventive medicine of a panel of laboratory tests carried out on a large number of subjects with automatic and standardized methods. The discussion went immediately to normal values and individual variability.
At the same time, IFCC-LM created a new expert panel on the ‘Theory of reference values’ under the leadership of Tom Whitehead and soon after, of Ralph Gräsbeck.
The beginning of the reference values
The Minerva Foundation Institute for Medical Research, Biomedicum U2, 00290 Helsinki, Finland
My school participated in an exchange of teachers between Sweden and Finland. We had an excellent Swede who taught biology, including genetics. I especially remember that he taught that biometric data, e.g., the size of the leaves of a tree, were distributed in the Gaussian manner. Later, as a medical doctor, I absorbed the prevailing belief that laboratory results from normal people were distributed in this way and that the usual normal values were calculated by taking the mean ±2 SD. Then, starting my scientific career, I participated in a population study involving 1345 persons whose serum vitamin B12 concentrations were assayed . Our team used an excellent statistician who presented our data as histograms and curves fitted to them. The ordinary Gauss curve did not fit, so he took the logarithm of the observed values, and hocus-pocus, the curves fitted excellently. I asked the statistician why, but only got the response that this was the customary thing to do. I was not satisfied and began to dig into the literature and found quite interesting articles (quoted in ) casting doubt on the usual Gaussian distribution.
In the late 1960s I became editor of Scandinavian Journal of Clinical and Laboratory Medicine. The editors were supposed to write editorials, so I wrote one together with a statistician Fellman . It criticized the use of the term normal and calculation of normal ranges assuming Gaussian distribution. At about the same time, I was chosen President of the next Scandinavian Congress of Clinical Chemistry and Clinical Physiology outside Helsinki in 1969. The organizers of the Congress had to devise programs for the sessions, and being sensitized to the topic I suggested ‘normal values‘ as a theme which was accepted.
In planning the session and my own presentation, I realized the analogous character of the normal values and the controls in the experimental sciences. I also thought that a less ambiguous term than normal values ought to be used (actually  already mentions reference population). My schoolmate Nils-Erik Saris was also one of the organizers of the Congress and experienced in producing IFCC recommendations, especially on quantities and units. I contacted him, and we decided to spend an October weekend in my island in the Baltic planning the session and especially clarifying the hazy concept of normal values and trying to produce a paper with suggested solutions of the problems. (A humorous incident occurred during our two-man sauna symposium in the naked state and has been reported) . During the winter season we produced a short paper  suggesting the introduction of the term reference values, outlining a number of branches of the field (nomenclature, reference populations, prenalytical factors, etc.) and calling for international collaboration.
Our suggestions clearly interested the audience, which at the end of the Congress formed the General Assembly of the Scandinavian Society sponsoring the Congress. The Assembly decided to activate the rather dormant function of the Society by nominating committees to produce recommendations in several fields of current interest: quantities and units, enzyme assays and normal values. I became chairman of the last-mentioned committee, which soon began to function and changed its name to the Committee on Reference Values. The other members of the Committee were R. Dybkaer from Denmark, S. Skandsen from Norway and M. Hjelm from Sweden. The early history of the Scandinavian Committee is found in .
At about the same time an international congress took place in Geneva and Evian. Afterwords I learned that I had become member of an analogous international Committee on Reference Values of IFCC. When it met, T.P. Whitehead was the chairman and the rest of us members of ‘The expert panel on the theory of reference values’. Its other members, except myself, were G. Siest (F), G. Z. Williams (USA) and P. Wilding (UK); Siest and Wilding entered the Panel somewhat later than the others.
The work of the Panel was slow, nothing was published, which evoked criticism. The cause was mainly disagreement on many points, e.g., the exact meanings of the terms reference value and health, but also on practical details such as specimen collection, some favoring strict rules, while others are lax. To provide some guidelines, a fairly long series of educational articles on reference values appeared in the IFCC Newsletter No. 16, February 1977 . A constructive international symposium on Reference Values in Laboratory Medicine was held outside Helsinki in May 1980 . IFCC at that time had rules preventing a member of a panel to serve for more than a fixed number of years. Whitehead therefore resigned in 1976, and I was appointed chairman of the Panel. The panel now produced its first recommendation, albeit only a preliminary one ‘Provisional recommendation on the theory of reference values’ dated 1978 . Then, I was in turn to resign, but continued to be an unofficial adviser and a link to the Scandinavian Committee, which decided to stick to the practical rather than the theoretical aspects of reference values.
In the work of the Scandinavian Committee, Solberg proved himself very competent, so I succeeded in getting him appointed as my successor. Under his guidance the Expert Panel completed an impressive number of recommendations. He and the members of the Panel (including myself) also spread the gospel of reference values in personal articles, one of the most noted ones being a humorous ‘Miss Manhattan’ study correcting the view that the requirements for collecting reference values were almost impossible to satisfy .
At the second Pont-à-Mousson meeting in 1972, we described the first results on our family population and we published the first book on reference values . We were in contact with all the scientists involved in the field and in IFCC but also with Tom Whitehead and Peter Wilding who participated in the Pont-à-Mousson meeting. They were working in a comparable health screening organization BUPA in the UK. Production of reference values required large healthy populations, military recruits, blood donors, medical students, and laboratory workers etc… populations are poor representatives of healthy populations. The very comprehensive studies have been published by health screening centers (Table 1).
The recruitment of healthy subjects is crucial for defining better laboratory test biological variability.
We were in a good position with Peter Wilding to develop the first diagrams on the definition which were at the origin of the IFCC recommendations.
Memories of IFCC Expert Panel activities in 1974
Peter Wilding, PhD, F.R.C.Path., FACB
Professor Emeritus, University of Pennsylvania, Philadelphia, PA, USA
In 1974, I was invited by Prof. Tom Whitehead to join the IFCC Expert Panel on the Theory of Reference Values. The panel at that time included Prof. Ralph Gräsbeck, Prof. George Z. Williams, Prof. Gerard Siest and myself. It is only in recent years that I have come to truly appreciate the privilege I enjoyed being part of a group that would establish the principles and terminology that governs the topic of reference value use, and assignment, today and in future years. Moreover, I gained so much from the interaction with four outstanding clinical chemists who worked energetically to improve our profession.
For several years the Expert Panel had strived to define its purpose, and determine a strategy, for proposing a clear and definitive nomenclature for the topic. Much of the problem lay in finding a way to present the subject in a simple and unambiguous manner that would facilitate understanding of why we needed reference values, how to create them and how to interpret them.
In 1976 Gerard Siest and I were asked to find a structure for presentation. We met one evening at a London hotel and after a lengthy, and animated, discussion formulated a diagram that we believed would meet the Panel’s needs. The diagram was presented to the Panel, where it was accepted unanimously, and was presented to the profession in several journals in 1978. The diagram (see below) has stood the test of time and laid the basis of numerous derivative articles from 1978 to 1991 and Approved Recommendations from IFCC. None of the five individuals who formulated, and presented, this time-tested proposal are included in the authorship of the Approved Recommendations, but whenever I see, or read, articles using terms such as ‘reference individuals’, ‘reference values’ and ‘reference intervals’, my memories of a long evening in a London Hotel conference room arguing, discussing and debating with my friend Gerard Siest are revived with a tremendous sense of accomplishment over what we created.
The ‘original’ Scheme (Siest and Wilding):
From which is selected a
REFERENCE SAMPLE GROUP
On which are determined
On which is observed a
From which are calculated
That may define
This first document corresponds to the first IFCC paper published 13 years later .
Outside this expert panel, different groups of scientists were involved in studies on reference values between the years 1970 and 2000. We could classify them into four international groups:
1. SFBC (Société Française de Biologie Clinique) and French speaking group:
G. Siest, J. Henny, R. Guegen, Ch. Sachs,
L. Guize, B. Gouget, G. Blin, J.P. Bretaudière,
M. Drosdowsky, H. Favre, E. Daubrosse-Panek,
A. Aellig, J.M. Queralto, F. Ramon, A. Albert,
C. Heusghem, C. Petitclerc, L. Munan, R. Zender…
2. Scandinavian group:
R. Gräsbeck, R. Dybkaer, S. Skandsen, M. Hjelm,
H. Solberg, P. Hyltoft-Petersen, N. Lahti,
M. Jungner, A. Kallner
3. American and British group:
G.Z. Williams, E.K. Harris, D. Young, J. Boyd,
P. Wilding, J. Lott, C. Fraser
4. Spanish group:
J.M. Queralto, F. Antoja, M. Cortes, M.V. Domenech,
J. Fuentes, M.J. Llagostera, J. Ordonez, F. Ramon,
I. Rojo, J. Fuentes-Arderiu, J. Horno, P. Ferrer
Due to the activity of the same people in IFCC and the French speaking group, the recommendations have been prepared simultaneously by both groups perhaps with more details in the SFBC one where the meetings were more frequent. Some countries, e.g., Spain also had a reference values committee with good input from our Spanish colleagues.
We can compare the papers published by the four main groups, they are not equivalent. Often papers in French were published before and used as reference documents later for IFCC recommendations that came much too long after. IFCC Expert Panel on theory of reference values was created in 1972 and the last approved recommendations came 19 years later! (Table 2).
The SFBC Reference Values Working group was created in 1973 and recommendations were published in the early 1980s; a last recommendation on the use of reference values  was not addressed by the IFCC expert panel. The Spanish group was also very active, publishing six articles of recommendations. Two of them were additional documents to those of the IFCC and SFBC: one on the transferability of reference values, the second one on the intra-individual reference values.
Simultaneously, the Scandinavian group, published in 1993, on behalf of the Committee on Reference Values of the Scandinavian Society for Clinical Chemistry the recommendation for the preanalytical step including management of the biological samples .
The works of the different committees were presented in IFCC conferences or other specific meetings. They were occasions to meet our American and Canadian colleagues. That was the case in 1975 in Toronto, where we presented the concept of reference state and met Claude Petitclerc and Louis Munan. They have developed an ‘atlas of blood data’ and it was the start of a fruitful collaboration.
Claude Petitclerc, MD, PhD, F.R.C.Path.
Department of Biochemistry, CHUM-Hôpital-Notre-Dame, Montreal, QC, Canada
In the 1970s, I was involved in a long and fruitful collaboration with epidemiologists Louis Munan and Anthea Kelly at Sherbrooke University. The Sherbrooke data obtained from a probability sample of families of a large freestanding population was the subject of many publications on reference data and correlates [56–59]. In 1978 I took sabbatical leave to join Gerard Siest’s team.
CMP was the mecca. Not only was the staff remarkable in number and quality but the study population was much larger, family-based and very well-documented. Reference values were systematically produced and published.
Similarities of both sampled populations and laboratory technologies prompted a comparative study that revealed not only population differences but also analytical biases . The problem of transferability of laboratory results and hence reference values had to be addressed. Reference data produced with the technology of the 1970s could not be used decades later. However, the enormous contribution of Siest et al. at CMP was the systematic study of factors of biological variations and their stratification for each substance. This information, independent of technology, was and remains transferable.
More complex was the issue of analytical variability imbedded in the massive amount of population data over the years, especially at the time of data mining. It was then a concern and is still one. A challenge for study of long-term longitudinal variations. A long and fruitful collaboration of 35 years was the outcome of this sabbatical leave.
During my stay, I had the honor to serve on Commission des valeurs de référence de la Société Française de Biologie Clinique (SFBC). Inspired by Gerard Siest‘s leadership SFBC was very proactive in publishing guidelines for the production of reference values; being part of such a dynamic group opened the way to relay Gerard Siest on the Expert Panel on Theory of Reference Values of IFCC. I keep a vivid memory of the members of the panel with whom I enjoyed to exchange with for some 10 years under chairman Erik Solberg. Document 2 and 3 were spinoffs of both SFBC and Scandinavian Society for Clinical Chemistry recommendations.
CMP is no longer what it once was, but the people who made it what it was are still around, with us or in our memory. I cherish them all.
Second movement: adagio (1980–2000)
Between 1980 and 1990, it was a period of maturation but no efforts were made for developing the practical use of reference values by clinicians. We organized in 1983 a meeting in Nancy with Gene Harris to try to develop a new strategy based on his statistical competences and in collaboration with the other statisticians in our group or in Europe.
The importance of Eugene K. Harris contribution
Josep M. Queraltó
Hospital de la Santa Creu i Sant Pau, Servei de Bioquimica, Av. Sant Antoni M. Claret, 167, 08025 Barcelona, Spain
I was very happy to work with Eugene K. Harris during a sabbatical period. I am indebted to him sharing his great scientific culture with me. Although reference values theory clearly is a part of the so-called post-analytical phase, the development of reference values theory is without a doubt a decided drive to refine preanalytical and analytical concepts. The central idea to create a space where a presumptive patient can be tested against non-diseased individuals raised the need for rigorous, practical tools to manage physiological pathological knowledge. Uncertainty, variability, sampling, confidence, tolerance, robustness, expectancy… are concepts with an accurate meaning in mathematical statistics which soon were part of the reference values theory. Accurate use of these terms deserved the special attention of all the groups which conceptualized reference values. Among the statisticians who played a key role to translate statistical science into laboratory medicine was Eugene K. Harris (1927–1997). Gene Harris was a wise man with a broad culture and scientific interest. Gene had an exceptional ability in grasping what those professionals which consult him really needed. At the end of the 1960s, Gene begun to collaborate with Prof. George Z. Williams, who had already published some articles about laboratory data process . Their insights on the components of medical laboratory data variation, biological and analytic, were published in a series of influent articles [2–6]. He realized the relevance of within-person variability in the assessment of biological variation of a number of analytes [62–65]. Eugene Harris contributed significantly in virtually all the aspects of the reference values theory where statistics are required: gaussianity requirements , analytical goals [67, 68], analytical variation in monitoring reference individual over time [69, 70], the relative influence of analytical to biological variance , the relevance of intra-individual component of biological variance [72, 73], the interpretation of the practical significance of a change in laboratory results [74–78], multivariate regions of reference  and the requirement of partitioning reference populations [80, 81], as well as comprehensive reviews  and books on statistical aspects of reference values production and use [83, 84]. Moreover, he was persuaded of the need to convey to both clinical chemists and doctors the clearest and sharper statistical concepts together with the straightforward and practical procedures and methods available. At that time, when personal computers did not enjoy today’s facilities, he provided also useful and simple code to help in statistical calculations . The vision and personality of Eugene Harris inspired the work from national and international committees and scientists [32, 52] and today still remains as a reference.
During this period, we started the recruitment of 1000 families of the Stanislas cohort  on a more specific research group in CMP focused on individual specific reference values in subject under medication, e.g., contraceptive pills. We tried also to propose new presentation of reference limits particularly for a better use in prevention on health maintenance including for and by the patients themselves.
The data produced were not limited to biochemical components but also to weight, blood pressure etc… The material collected in the Stanislas cohort project comprised a lot of interesting longitudinal familial data and of biological variability and many publications came out from this tool  and particularly the effects of genetics , e.g., Apolipoprotein E.
We tried many times particularly as IFCC president to recreate an IFCC reference values committee but the IFCC scientific committee opposed propositions made by a group of scientists. The objectives of this committee would comprise the following:
Update IFCC recommendations;
Produce practical guidelines;
Increase the transferability;
Improve the proper and extended use of reference values by clinicians, laboratory scientists and patients.
We proposed, without success, a joint CCLM IFCC committee for a better and quicker diffusion of the work and the recommendations.
The founding texts of various scientific societies over the past 20 years are true reference knowledge. They had the merit to disseminate the concept of reference values. Thanks to them, the RIs have become a basic tool for the interpretation of the quantitative results of laboratory tests. However, the evolution of medical practice led to question the current relevance of this concept.
Third movement: Allegro ma non-troppo (2000–2012)
It is obvious that in the late 1990s the concept of reference values is progressively used by all health professionals, including clinical chemists, clinicians, and simultaneously by all official bodies in charge of the establishment of the legislation. Its use by the In Vitro Diagnostic (IVD) industry and laboratorians is now recommended by the European Directive 98/79 EC  and the ISO 15189 standard [88, 89].
In contrast, the application of the concept remains difficult in clinical and laboratory practice: procedures for estimating reference limits need to be improved. They are too long and too expensive and are not feasible for all laboratories. In 2000, a group of scientists and professionals focused on areas for improvement by proposing that societies and organizations implement the current recommendations .
Fundamental works: evolution of the concept and proposals for new procedures
In the early 2000s The Catalan Association of Clinical Laboratory Sciences organized two successive symposia for opening dialogue between professionals and industry representatives. It approached the legal and normative alternatives to the production of specific reference values and the transferability of reference values and the methodological aspects (how define a homogenous starting population, how to define acceptable health status for reference individuals, methodological and statistical aspects, diffusion and tracking of the information) .
A year later, in 2004, after the second Catalan Symposium, a special issue of CCLM was published in order to share the basic concepts and new ideas on the field reference values . The expressed opinion were very diverse and do not reflect systematically the official practices. It was the choice of editors to open debates and initiate discussions. Among the 25 contributions it is impossible to say which are the most determining and most helpful. At the risk of being wrong, here are the themes that seem, 10 years later, the most significant: is there a concept of health and normality?, statistical description of reference values, interpretation of reference limits, the concept of common and multicenter reference intervals, and analytical aspects in relation to reference intervals.
Per Hyltoft Petersen
NOKLUS, Norwegian Quality Improvement of Primary Care Laboratories, Division for General Practice, University of Bergen, Bergen, Norway
I have been involved in application of the concept in creation of common reference intervals for plasma proteins in the Nordic countries in the 80th and 90th, mostly published as NORDKEM reports and in Upsala Journal of Medical Sciences [93, 94]. NORIP, the other Nordic project on common reference intervals, was chaired by Pål Rustad in Oslo, Norway.
The development of the concept on reference values was a great intellectual conquest which solved many problems in relation to interpretation of clinical laboratory data. The concept, however, is developed for single laboratories, so there may still be differences in measurement results and in the interpretation for the same samples analyzed in different laboratories. Consequently, the next aim was to harmonize measurement results and interpretation of data among laboratories, so that results were interchangeable and clinical explanation would be independent of which laboratory performed the analysis.
Analytical quality: The question on analytical quality of measurements is technical analytical and relates to many elements in the traceability chain from reference method values or international certified reference preparations to the single measurement result. Even these efforts are essential; the final analytical quality can only be proved by control with genuine human materials with target values measured by the reference method. The documentation procedure is considerable and rules for acceptable analytical quality specifications are outlined.
With the analytical quality assured and documented, the next question is whether the reference intervals are dependent on gender, genetic and other biological characteristics.
Models for treatment of the biological information from reference values
Even the idea of sampling of reference individuals for estimation of reference intervals is clear in the sense of defining healthy individuals through rule-in and rule-out criteria, the concept of health is more complicated than supposed for varying criteria, and a common question is whether the reference interval should mirror the current apparently healthy population (without any known diagnosis) or be an ideal for all, e.g., based on teetotallers.
Further, differences in rule-in and rule-out criteria may be needed for varying components. A reference interval for, e.g., serum-thyroid stimulating hormone (TSH) will need criteria for absence of thyroid antibodies in all reference individuals, and when, e.g., reference values are to be collected for plasma-glucose and HbA1c in relation to diabetes diagnostic, the criteria will be very strict regarding, e.g., first-generation diabetes relatives. Moreover, the biological concept has been overruled by the international clinical guidelines which recommend diagnostic decision limits for the diabetes diagnosis, neglecting biological variation and other individualities.
When two or more subgroups of reference individuals are disclosing deviating reference intervals, it is necessary to decide whether the groups can be combined or should be separated. Varying partitioning criteria have been proposed, but the dominating model is to separate if the result of using the common reference interval results in percentages of reference individuals for lower as well as upper reference limits for both groups are beyond 0.9%–4.1%.
The idea of common reference intervals is extremely attractive and several attempts have been made. For serum electrolytes and some metabolites it seems possible to establish common reference intervals within ethnic groups and selected geographical areas, but for most other components it is only possible within smaller well-defined groups, and it is necessary to test whether common reference intervals can be established.
Further, I have been involved in evaluation of analytical quality specifications, together with Elizabeth Gowans, Callum Fraser, and others, and in proposing criteria for partitioning together with Ari Lahti.
Together with Callum Fraser, Anders Kallner and Desmond Kenny, I arranged the Stockholm conference on analytical quality specifications, published in Scand J Clin Lab Invest 1999;59:475–585, and together with Joseph Henny I edited the CCLM issue on reference values . Moreover, I wrote a paper together with Esther Jensen and Ivan Brandslund: Analytical performance, reference values and decision limits. There is a need to differentiate between reference intervals and decision limits and to define analytical quality specifications which shows many of the restrictions of use of reference intervals and of decision limits .
In 2012, a 2nd Special Issue of CCLM devoted to reference values was published. It highlights, once again, the interest in this topic by successive editors of this journal. It brings together a variety of more than 40 articles most of which are devoted to the determination of RIs for particular analytes .
Following and/or in parallel with the publication of CCLM’s Specials Issues, numerous articles were published mainly in CCLM and in some other scientific journals (mainly in Clinical Biochemistry and in the Scandinavian Journal of Clinical Laboratory Investigation). Several themes are of particular interest:
The concept of common and/or multicenter reference intervals has been developed in order to avoid laboratories having to produce their own reference values. Two different approaches have been proposed:
The first was established in the 1980s by the Nordkem Project [93, 94] followed up some years after on a larger scale by the Nordic Reference Interval Project 2000 (NORIP) to determine common biological reference intervals in wide geographical areas for biochemical quantities and for large populations reference [97–100]. It is also clearly stated that several conditions must be fulfilled: 1) reference population from the different countries must have the same genetic profile and have a similar lifestyle; and 2) certain quality requirements must be met (including traceability of measurement systems, each laboratory using routine analytical systems). This project involved the participation of 102 laboratories in five different countries (Denmark, Norway, Sweden, Finland and Iceland).
The second approach was proposed by the Catalan Association for Clinical Laboratory Science . The concept is a little different: several clinical laboratories in the same geographical area, using the same analytical system determine reference intervals in the context of a collaborative project with the IVD industry. Recruitment of individual’s reference is under the responsibility of each clinical laboratory. Each step preanalytical, analytical, statistical treatment follows a well-defined protocol. Metrological traceability is ensured. Several examples of application to specific analytical systems were published in CCLM [105–110].
The approach proposed by the Catalan Association provides only partial response: the RIs proposed are valid only for the considered analytical system. The Scandinavian approach is appealing: the calculated RIs apply to routine methods of each clinical laboratory; potential biases of each routine method are eliminated by the use of common reference materials measured in each participating laboratory. However, this is a very demanding procedure applicable to laboratories having an outstanding experience of standardization as shown by Hyltoft Petersen and Rustad . However, it is necessary to observe a number of rules specified by Ceriotti  and Boyd .
Improvement of statistical procedures has mobilized the energies, none of those proposed in the multiple recommendations are, to this day, perfect.
Data mining: Although the use of data mining procedures are extremely controversial, due to the lack of information on reference individuals which may lead to selection bias, the Haeckel’s staff, in Germany defends the use of broad data sets from laboratory information systems to derive intra-laboratory reference limits. The RIs are derived by sophisticated statistical procedures, requiring the intervention of highly skilled biostatisticians [114, 115]. The authors note, however, that if the differences between laboratories RIs are observed, they are mainly due to biases between the different analytical procedures [116, 117]. However, Kallner showed that the central 95 percentile derived from Primary Health Care population was wider than those usually assigned, whereas the median almost coincided . He and his coworkers claimed that remaining non-healthy individuals enlarge artefactually the reference intervals.
A similar approach was developed by Grossi et al.  in the project REALAB. In a primary health care center, RIs are derived by retrospective analysis of a large set of data collected during three years after a careful selection of non-diseased individuals (based on clinical criteria) and with adapted statistical procedures.
Partitioning: the most common partition criteria are sex and age. If the reference set of data come from different reference populations (geographical regions, ethnical groups), the origin of each population can be considered as a partition criterion. There is currently no ‘gold standard’ for partitioning a reference sample. Several methods have been proposed by different authors and IFCC [120–122]. Lahti has done an outstanding critical comparison of the main methods proposed .
The concept of multivariate Reference Region was proposed as alternative to univariate reference intervals in the 1970s in order to reduce the number of ‘false-positive’. Even, if this concept has not been really currently used in clinical laboratories practice until now, it remains promising for the future .
The appropriate statistical procedures are a key to derive the RIs. It is difficult to make a choice among the proposed methods. Ichihara and Boyd published in 2010 a review of possible methods . Among them the authors point the multiple regression analysis as a powerful tool to identify the most important factors of biological variation and the analysis of variance (ANOVA) as an effective way to assess the need for partitioning reference values.
Robust approach: The IFCC recommends that a minimum number of subjects (n=120) is recruited to derive RIs. In some cases it is not possible to reach the suggested number. Horn and Pesce [126, 127] have proposed an alternative method to estimate reference intervals for data sets with a small number of observed values. Robust method to calculate the upper reference limit gives results comparable to traditional methods, but with a confidence interval higher.
Outliers and non-healthy effects individuals: Faced with the impossibility of recruiting a perfectly homogeneous reference population, statistical techniques are used to mitigate the effects of ‘abnormal’ individuals (in the sample of reference) on the width of the reference range. Horn believes this increase is approximately 10%. This reinforces the importance of proper selection, a priori, of the reference individuals. Usually the Tukey method is most commonly used. Horn proposed a more sophisticated method based in a first stage on a Box-Cox transformation, then in a second stage of labeling extreme values by the method of Tukey .
The analytical bias affects the measurement results and can change RIs, then this may can change the interpretation of results. Determining RIs with analytical procedures traceable to a reference system allows transferring results from one laboratory to another . The use of this strategy should be promoted and favored by IDV manufacturers to provide more effective information to clinicians. However, it has no universal value for many years; it concerns only the analytes traceable to a reference system (about 60 to this day). For others the issue of transferability remains. C-RIDL Committee is working hard on this issue and should propose solutions soon. Alternatives could also be considered, such as the use of a common calibrator, but also with other limitations .
The role of the IFCC-LM: creation of C-RIDL (Committee for Reference Intervals and Decision Limits)
Until 2004 the Scientific Division of the IFCC-LM was no longer interested in the reference values issues. It must be recognized that the guidelines proposed by the IFCC and the NCCLS (now CLSI) were less and less adapted to the needs of the professionals: IVD’s manufacturers and laboratorians. Directive 98/79/EC of the European Community requires manufacturers to provide reference limits in the package inserts of laboratory reagents kits . Other international organizations, such as ISO also recommends the use of RIs in the laboratory reports . To meet the growing demands of international regulations and, largely on the insistence of IVD manufacturers, IFCC-LM decided in 2005 to create the Committee on Reference Intervals and Limits decision (C-RIDL). It was composed of Ceriotti F. (Chairman), Boyd J., Henny J., Kairisto V., Klein G., and Queralto J.M. Almost simultaneously, a working group of the CLSI was created with similar objectives. A joint Working-Group IFCC-LM and CLSI was created in 2005 during the XIX International Congress of the IFCC-LM in Orlando. The purpose of this joint Working-Group was to review the document CLSI (C 28–A) published in its original version in 1995  and revised in 2000  in order:
To provide clinical laboratories and diagnostic test manufacturers with updated guidelines for determining reference intervals for quantitative analytes;
To provide recommendations regarding procedures that can be used to verify reliable reference intervals for use in laboratory medicine; and
To publish a document common to IFCC and CLSI.
Updated document was published in 2008 . The major guidelines of the original recommendations were preserved in the updated document. Three main innovations were introduced: the introduction of a robust method for analyzing references values (especially if the number of data is <100), the transference of RIs and RIs verification, and the reference intervals multicenter studies [134, 135].
Independently, in parallel, C-RIDL published from 2005 to 2010 a series of works. The first seeks to identify universal reference intervals for creatinine. A careful study of the literature showed that very few articles had taken into account the requirements of metrological traceability, thus few data from the literature are transferable from one laboratory to another. The C-RIDL established that for laboratory using a method traceable to IDMS the proposed RIs can be used .
The second job of the C-RIDL was devoted to the opportunity to determine Common Reference Intervals. In a preliminary article Ceriotti recalled the position of the IFCC-LM and prerequisites to follow . These are related to the design of the study and the analytical conditions (traceability). Ceriotti also recalls that the populations (that of laboratory and that of the study) must be comparable. An application’s work was made for the measurement of three enzymes (AST, ALT and GGT) measured with commercial analytical systems, but according to the standard methods recommended by the IFCC. Patient’s sera from four regions (Italy, China, Turkey and the Nordic countries) were analyzed. For AST and ALT the use of common RIs appears possible. In contrast, GGT shows significant differences between populations, worldwide RIs do not seem applicable .
Finally, two members of the Committee attempted to clarify the position of Reference Intervals vs. Decision Limits. In an article in the eJICC, Ceriotti and Henny recalled that the reference limits should not be confused with the decision limits. They recalled their definitions and characteristics (Conditions Influencing them, Information gathered, Statistical methodologies …). This is one of the few articles on the role of reference intervals in the process of the interpretation of laboratory results . Incidentally, this topic was raised again a few years later by Haeckel  and Hyltoft Petersen .
In 2011, Ichihara K. became the new chairman of the C-RIDL renewed (members: D. Armbruster, Barth J., Klee G., Ozarda Y., Pekelharing M.). Now, it aims to derive reliable country specific RIs through multicenter studies at a global level. A protocol was developed by the Committee. It will be applied to markers traceable to a reference system and non-standardized biochemical markers. The ongoing project will demonstrate the feasibility of the protocol and deriving RIs applicable to a global scale if there are no regional and/or ethnic groups’ differences. In 2012 the study is in progress and the results should be published in 2013.
Deriving works of reference limits
Besides these fundamental works extensively devoted to the derivation of reference intervals, numerous articles have been published in various scientific journals:
The influence of ethnicity as a partition factor has never been really studied systematically. Horn  and Johnson  observed for some analytes differences between populations that require separate reference intervals. In this way a large number of works from emerging countries have been published most often in scientific regional journals (they are too numerous to be mentioned here). Most of them follow (more or less) the traditional protocol of the IFCC, applied according to local opportunities and constraints. Each article covers a limited number of analytes and is applied to a group of well-defined populations.
Another group of works relates more specifically to certain periods of life: pediatric age, elderly and pregnancy.
Pediatrics reference intervals determination is one of the most difficult tasks to carry out, primarily because of ethical limitations related to blood drawing in very young children (or in neonates). Apart from a few specific derivation RIs studies for highly specialized analytes [143–146], the Canadian multicenter study CALIPER, aimed to establish a data base of age- and gender-specific pediatric intervals for 40 serum biochemical markers [147, 148]. As emphasized, Ceriotti  the authors have followed the recommendations of the IFCC and CLSI including selection of young children in good health (on the basis of clinical data). However, traceability of analytical data is not truly established. Also transferability of produced RIs may not be always transferable to other analytical systems. The Canadian group, as a service to major clinical laboratories, published a series of articles showing RIs related to various specific analytical systems [150–153].
Elderly RIs determination is even more challenging than for young children. The main difficulty is to select healthy individuals: most seniors do not meet the criteria of the IFCC/CLSI. Taking medication(s), unrecognized subclinical diseases can alter the width of the reference range. Hence it is very difficult to differentiate the effect of age, aging or a pathological condition. During this period 2000–2012 some authors are interested in this issue [154–156].
Pregnancy is a factor of biological variation due to physiological changes during pregnancy and postpartum. Although changes during pregnancy are well-known some articles are still published [157–159].
Genetic is now a well-recognized determinant for many biochemical markers. Use of genetic information for partitioning reference intervals could reduce the misidentification of unusual test results caused by non-disease associated genetic variation, as pointed by Shirts . Siest and co-workers started to be more precise in using genetic information for subgroups stratification, e.g., for ApoE  and more recently for haptoglobin . However, the part of biological variability induced by genetic variants is often low and the knowledge of the genetic status of the reference individuals is often lacking. In these conditions the partitioning of RIs by genetic information remain for the moment (and for many long years) only a possible way for the future.
During these years, the theory of reference values was unexpectedly applied to the veterinary laboratory medicine! A French team from the Veterinary School of Toulouse published several theoretical and applied articles, noticeably on reference intervals derived for several animals [162–166].
Fourth movement: finale
What is the situation so far from the CCLM 2000 article which identified gaps and desires of professionals ?
On the methodological level:
The importance of careful selection of reference individuals was reaffirmed by the IFCC-LM and confirmed by many authors. Selection from the database without having clinical information to ensure that people are healthy is prohibited. However, there is still no clear guidelines for the issue of RIs for unstable periods of life (e.g., aging or elderly).
One possible source of reference samples may come from large biobanks that are around the world, especially for studies of large populations. These samples come from individuals generally in good health. They are associated with clinical data often sufficiently comprehensive to select healthy individuals. Unfortunately, the only caveat remains the legal and regulatory restrictions on the use of biological samples in countries other than the original one. We must mention here the first experiment dedicated to the determination of RIs launched by the Scandinavian group which initiates the NORIP Project .
The issue of ethnicity as a factor of variability is still not resolved. We know that there are differences between ethnic groups for some analytes. Information is scarce and far from exhaustive. This is a major obstacle for the transferability of RIs without precautions. How to address this issue in a world where the notion of ethnicity is more and more difficult to define because of population movements? Finally, are the observed variations clinically significant? Genetic profile will help to define better the subgroup.
Following the initial concept of multicenter reference intervals we can recruit a larger number of reference subjects by dividing the efforts between several laboratories, but analytical bias between different analytical systems remain.
The concept of common reference intervals similar to the previous, emphasizing metrological 3traceability, allows the transfer of RIs to all laboratories using methods traceable to the same reference system, but to them only.
Traceability to a reference system is limited to a small number of analytes, other methods of transference and verification of available RIs are proposed by the IFCC-LM and CLSI, of course, with some limitations.
New statistical methods have been introduced and validated by the IFCC:
The robust method allows deriving RIs on small data sets of values. The accuracy of the so determined limits is less good than with the number of reference samples recommended, but it is an indispensable tool for certain populations (e.g., neonates, elderly, rare analytes, …).
Older forgotten statistical methods: multiple regression analysis, ANOVA are very useful for estimating the factors of biological variation.
The statistical methods of partitioning are effective, but they are limited in the presence of several sub-classes.
On the clinical use of RIs:
The concept of RIs is often misunderstood by medical doctors. It has been clearly reaffirmed that RIs should not be confused with the decision limits. The RIs are descriptive of a specific population and are derived from a reference distribution (usually 95% interval). Instead, the decision limits are thresholds above or below which a specific medical decision is recommended.
What is the use of RIs in clinical practice? This is a (small) help to medical interpretation when there is no medical decision threshold established, when the methods are not standardized (this is the case for many methods in immunochemistry). It is therefore particularly important that RIs are calculated as accurately as possible, in other words, they reflect a ‘true’ healthy population: sensitivity and/or specificity of the medical diagnosis depend on.
It would be incomplete if we do not take into account that, for many analytes, the variation is a continuum. Continuous variation of a biomarker may have diagnostic significance more important than a result beyond the upper RL With this in mind the concept of Reference Change Value could be really useful: unfortunately it is not really used in everyday practice .
In terms of access to RIs:
It is important to recognize that comprehensive information is not really available: some textbooks were published a few years ago, but they are not always available. In some articles which were published in various scientific journals, information is sparse, incomplete for only some markers, for well-defined populations and do not always meet the methodological requirements needed. Information from the literature is not always transferable and is not easily accessible to the professional community. This is why the major source of information is that of the IVD’s manufacturers, reported in the package inserts. However, is this information still reliable? Are they sufficiently informative? Are they still applicable to the population of each laboratory? The quality of the information provided in the technical documentation is very heterogeneous. Some manufacturers give very accurate and comprehensive information on how were selected reference individuals, ethnicity, number of individuals, and the statistical methods used. However, this is not always the case…. We can only encourage IVDs manufacturers to follow recommendations proposed by the IFCC and CLSI…
Training of medical doctors and laboratorians:
The misuse of the concept of RIs was noted above. Major educational efforts should be made towards medical doctors so that they understand the purpose, scope and limitations.
Publication of educational materials would be helpful. In this respect, even very simplified documents produced by the AACC and published on the website Lab Tests online is a first step, quite remarkable, but … towards patients .
Improving the presentation of the results has not been a major focus of interest at all for many years. The inclusion of more information on the biological variation factors (and others) in the analysis reports should be possible by new information technologies.
Real progress has been made in recent years, particularly in terms of methodology. CCLM contribution was instrumental in disseminating the work of the IFCC and many independent teams. A non-exhaustive survey of the literature shows that CCLM published in recent years the largest number of articles on the concept of reference values and its applications. There are still many fields to improve and complete: CCLM publishers have the duty to hold one’s position to this topic!
The scope of reference values and/or reference limits will remain attractive for many researchers for many years to come. However, we believe that future works should not be directed almost exclusively to methodological issues (statistical, analytical, …), but focus more specifically on the real contribution of the reference limits to the interpretation of laboratory results and to better define their place in medical decision-making. It must not be forgotten that these are based primarily on evidence-based clinical decision limits. This development is absolutely necessary to allow optimal use of laboratory tests and avoid misdiagnosis. This is a very exciting issue for the young scientists which needs, however, a close cooperation between laboratorians and medical doctors. It is on this condition that laboratory medicine can contribute to the improvement of Public Health for the benefit of patients.
Conflict of interest statement
Authors’ conflict of interest disclosure: The authors stated that there are no conflicts of interest regarding the publication of this article.
Research funding: None declared.
Employment or leadership: None declared.
Honorarium: None declared.
Plebani M, Lippi G. Reference values and the journal: why the past is now present. Clin Chem Lab Med 2012;50:963–4. Google Scholar
Williams GZ, Young DS, Stein MR, Cotlove E. Biological and analytic components of variation in long-term studies of serum constituents in normal subjects. I. Objectives, subject selection, laboratory procedures, and estimation of analytic deviation. Clin Chem 1970;16:1016–21. Google Scholar
Harris EK, Kanofsky P, Shakarji G, Cotlove E. Biological and analytic components of variation in long-term studies of serum constituents in normal subjects. II. Estimating biological components of variation. Clin Chem 1970;16:1022–7.Google Scholar
Cotlove E, Harris EK, Williams GZ. Biological and analytic components of variation in long-term studies of serum constituents in normal subjects. 3. Physiological and medical implications. Clin Chem 1970;16:1028–32.Google Scholar
Young DS, Harris EK, Cotlove E. Biological and analytic components of variation in long-term studies of serum constituents in normal subjects. IV. Results of a study designed to eliminate long-term analytic deviations. Clin Chem 1971;17:403–10.Google Scholar
Harris EK, DeMets DL. Biological and analytic components of variation in long-term studies of serum constituents in normal subjects. V. Estimated biological variations in ionized calcium. Clin Chem 1971;17:983–7. Google Scholar
Statland BE, Winkel P, Bokelund H. Factors contributing to intra-individual variation of serum constituents. 1. Within-day variation of serum constituents in healthy subjects. Clin Chem 1973;19:1374–9. Google Scholar
Statland BE, Winkel P, Bokelund H. Factors contributing to intra-individual variation of serum constituents. 2. Effects of exercise and diet on variation of serum constituents in healthy subjects. Clin Chem 1973;19:1380–3.Google Scholar
Bokelund H, Winkel P, Statland BE. Factors contributing to intra-individual variation of serum constituents: 3. Use of randomized duplicate serum specimens to evaluate sources of analytical error. Clin Chem 1974;20:1513–9.Google Scholar
Statland BE, Bokelund H, Winkel P. Factors contributing to intra-individual variation of serum constituents: 4. Effects of posture and tourniquet application on variation of serum constituents in healthy subjects. Clin Chem 1974;20:1513–9.Google Scholar
Winkel P, Statland BE, Bokelund H. Factors contributing to intra-individual variation of serum constituents: 5. Short-term day-to-day and within-hour variation of serum constituents in healthy subjects. Clin Chem 1974;20:1520–7.Google Scholar
Statland BE, Winkel P. Effects of preanalytical factors on the intraindividual variation of analytes in the blood of healthy subjects: consideration of preparation of the subject and time of venipuncture. CRC Crit Rev Clin Lab Sci 1977;8:105–44. CrossrefGoogle Scholar
William RJ. Biochemical individuality. The basis for the Genetotrophic Concept. Austin, TX: University of Texas Press, 1956. Google Scholar
40 years reference values and reference limits List (see Supplementary Data, which accompanies the article at http://www.degruyter.com/view/j/cclm.2012.51.issue-1/issue-files/cclm.2013.50.issue-1.xml).
Siest G. Organisation automatisation des laboratoires biologie prospective. Paris: Expansion Scientifique Française, 1970: 393–426. Google Scholar
Gräsbeck R, Nyberg W, Saarni M, von Bonsdorff B. Lognormal distribution of serum vitamin B12 levels and dependence of blood values on the B12 level in a large population heavily infected with Diphyllobothrium latum. J Lab Clin Med 1962;59:419–29. Google Scholar
IFCC News. 1980:26. Google Scholar
Gräsbeck R, Saris N-E. Establishment and use of normal values. Scand J Clin Lab Invest 1969;(Suppl 110):62–3. Google Scholar
Alström T, Gräsbeck R, Hjelm M, Skandsen S. Recommendations concerning the collection of reference values in clinical chemistry and activity report. Scand J Clin Lab Invest 1975;15(Suppl 144):1–174. Google Scholar
Gräsbeck R, editor. Reference values. Newsletter IFCC 1977;6:4–11 Google Scholar
Gräsbeck R, Alström T, editors. Reference values in laboratory medicine. Chichester, UK: John Wiley & Sons, 1981:413. Google Scholar
International Federation of Clinical Chemistry, Committee on Standards, Expert Panel on the Theory of Reference Values, Gräsbeck R, Siest G, Wilding P, et al. Provisional recommendation on the theory of reference values. Clin Chim Acta 1978;87:459F–65F, J Clin Chem Clin Biochem 1979;17:337–9. Google Scholar
Gräsbeck R, Solberg HE. Can reference values be properly used? Clin Chem 1981;27:1795–6. Google Scholar
Gräsbeck R. The evolution of the reference value concept. Clin Chem Lab Med 2004;42:119–23. Google Scholar
Siest G, editor. Reference values in human chemistry. Effects of analytical and individual variations, food intake, drugs and toxics – Applications in preventive medicine. Proceedings of the Second International Colloquium “Automatisation and Prospective Biology”, Pont-à-Mousson, October 10–14, 1972. Basel: Karger, 1973:347. Google Scholar
Solberg HE. International Federation of Clinical Chemistry (IFCC), Scientific Committee, Clinical Section, Expert Panel on Theory of Reference Values, and International Committee for Standardization in Haematology (ICSH), Standing Committee on Reference Values. Approved Recommendation (1986) on the theory of reference values. Part 1. The concept of reference values. J Clin Chem Clin Biochem 1987;25:337–42. Google Scholar
PetitClerc C, Solberg HE. International Federation of Clinical Chemistry (IFCC), Scientific Committee, Clinical section, Expert panel on Theory of Reference values. Approved recommendation (1987). Part 2. Selection of individuals for the production of reference values. J Clin Chem Biochem 1987;25:639–44 Google Scholar
Solberg HE, PetitClerc C. International Federation of Clinical Chemistry (IFCC), Scientific Committee, Clinical Section, Expert Panel on Theory of Reference Values. Approved recommendation (1988) on the theory of reference values. Part 3. Preparation of individuals and collection of specimens for the production of reference values. J Clin Chem Clin Biochem 1988;26:593–8. Google Scholar
Solberg HE, Stamm D. International Federation of Clinical Chemistry, Scientific Division: approved recommendation on the theory of reference values. Part 4. Control of analytical variation in the production, transfer and application of reference values. Eur J Clin Chem Clin Biochem 1991;29:531–5. Google Scholar
Solberg HE. IFCC Approved recommendation (1987) on the theory of reference values. Part 5. Statistical treatment of collected reference values. Determination of reference limits. J Clin Chem Clin Biochem 1987;25:645–56. Google Scholar
Dybkaer R. Solberg HE. IFCC approved recommendation on the theory of reference values. Part 6. Presentation of observed values related to reference values. J Clin Chem Clin Biochem 1987;25:657–62. Google Scholar
Siest G, Vernet-Nyssen M. Document A. Le concept de valeurs de référence en biologie clinique. Ann Biol Clin 1981;39:381–4. Google Scholar
Gueguen R, Albert A. Document B. Langage et principes statistiques pour les valeurs de référence. Ann Biol Clin 1985;43:297–305.Google Scholar
Siest G, Henny J. Document G: Production des valeurs de référence. Ann Biol Clin 1981;39:235–44. Google Scholar
Sachs C. Annexes du document G: Critères de partition et d’exclusion, objectifs. Ann Biol Clin Chem 1984;42:243–5. Google Scholar
Drosdowsky M, Ramon-Bauza F. Annexes du document G: Critères de partition et d’exclusion, age, sexe, puberté, ménopause. Ann Biol Clin 1984;42:246–8. Google Scholar
Sachs C, Buret J. Annexes du document G: Critères de partition et d’exclusion, poids corporel, surcharge pondérale. Ann Biol Clin 1984;42:249–56. Google Scholar
Aellig A, Sachs C. Annexes du document G: Critères de partition et d’exclusion, facteurs d’environnement: alcool, tabac et mode de vie. Ann Biol Clin 1984;42:256–65. Google Scholar
Siest G, Vernet-Nyssen M. Document D. Variations biologiques des examens de laboratoire. Ann Biol Clin 1979;37:229–39. Google Scholar
Drosdowsky M, Sachs C. Document E. Facteurs à prendre en considération pour le prélèvement sanguin en vue de l’établissement de limites de référence. Ann Biol Clin 1980;38:251–62. Google Scholar
Buret J, Vernet-Nyssen M. Annexes du document E: bibliographie. Ann Biol Clin 1980;38:261–2. Google Scholar
Daubrosse-Panek E, Petitclerc C. Annexe du document E: recommandations générales. Ann Biol Clin 1980;30:263–5. Google Scholar
Bretaudière JP, Albert A. Document C. Influence des facteurs analytiques sur les valeurs de référence. Ann Biol Clin 1985;43:306–9. Google Scholar
Albert A, Gueguen G, Sachs C. Document H: Traitement des valeurs de référence et détermination de l’intervalle de référence. Ann Biol Clin 1983;41:63–79. Google Scholar
Albert A, Gueguen G, Sachs C. Document I: Présentation des valeurs observées par rapport aux valeurs de référence. Ann Biol Clin 1983;41:225–31. Google Scholar
Siest G, Henny J, Guize L, Sachs C. Document J: Utilisation des valeurs de référence. Ann Biol Clin 1982;40:697–708. Google Scholar
Queralto JM, Antoja F, Cortes M, Domenech MV, Fuentes J, Llagostera MJ, et al. Documento A: Concepto de valores de referencia en Quimica Clinica. Quimica Clinica 1983;2:39–41. Google Scholar
Queralto JM, Ribo A, Cortes M, Domenech MV, Ferrer P, Fuentes J, et al. Documento D: Produccion y utilizacion de valores de referencia. Quimica Clinica 1987;6:49–68. Google Scholar
Queralto JM, Antoja F, Cortes M, Domenech MV, Fuentes J, Llagostera MJ, et al. Documento C: Variaciones analiticas y extra-analyticas en la produccion de los valores de referencia. Quimica Clinica 1984;3:43–50. Google Scholar
Queralto JM, Antoja F, Cortes M, Domenech MV, Fuentes J, Llagostera MJ, et al. Documento B: Bases estadisticas de la teoria de valores de referencia. Quimica Clinica 1983;2:95–105. Google Scholar
Queralto JM, Ribo A, Cortes M, Domenech MV, Ferrer P, Fuentes J, et al. Documento E: Transferibilidada de valores de referencia. Quimica Clinica 1990;9:102–23. Google Scholar
Queralto JM, Antoja F, Cortes M, Domenech MV, Ferrer P, Fuentes J, et al. Documento F: Valores de referencia intraindividuales. Quimica Clinica 1991;10:56–8. Google Scholar
Alström T, Gräsbeck R, Lindblad B, Solberg HE, Winkel P, Viinikka L. Establishing reference values from adults: recommendation on procedures for the preparation of individuals, collection of blood, and handling and storage of specimens. Committee on Reference Values of the Scandinavian Society for Clinical Chemistry. Scand J Clin Lab Invest 1993;53:649–52. CrossrefGoogle Scholar
Munan L, Kelly A, Petitclerc C. Population serum urate levels and their correlates; the Sherbrooke regional study. Am J Epidemiol 1976;103:369–82. Google Scholar
Munan L. Population based reference data. In: Rubin M, Benson ES, editors. Logic and economics of clinical laboratory use. New York: Elsevier, 1978:117–28.Google Scholar
Munan L, Kelly A, Petitclerc C, Billon B. Association with body weight of selected chemical constituents in blood: epidemiological data. Clin Chem 1978;24:772–7.Google Scholar
Kelly A, Petitclerc C. Transferability of reference data. In: Gräsbeck R, Alström A, editors. Reference values in laboratory medicine. Chichester, UK: John Wiley & Sons Ltd, 1981: 177–92. Google Scholar
Williams GZ. Individuality of clinical biochemical patterns in preventive health maintenance. J Occup Med 1967;9:567–70. Google Scholar
Harris EK, DeMets DL. Effects of intra- and inter-individual variation on distributions of single measurements. Clin Chem 1972;18:244–9. Google Scholar
Harris EK. Effects of intra- and interindividual variation on the appropriate use of normal ranges. Clin Chem 1974;20:1535–42.Google Scholar
Harris EK. Some theory of reference values. I. Stratified (categorized) normal ranges and a method for following an individual’s clinical laboratory values. Clin Chem 1975;21:1457–64.Google Scholar
Harris EK. Some theory of reference values. II. Comparison of some statistical models of intraindividual variation in blood constituents. Clin Chem 1976;22:1343–50. Google Scholar
Harris EK, DeMets DL. Estimation of normal ranges and cumulative proportions by transforming observed distributions to gaussian form. Clin Chem 1972;18:605–12. Google Scholar
Harris EK. Statistical principles underlying analytic goal-setting in clinical chemistry. Am J Clin Pathol 1979;72(2 Suppl):374–82. Google Scholar
Harris EK. Proposed goals for analytical precision and accuracy in single-point diagnostic testing. Theoretical basis and comparison with data from College of American Pathologists proficiency surveys. Arch Pathol Lab Med 1988;112:416–20. Google Scholar
Williams GZ, Harris EK, Widdowson GM. Comparison of estimates of long-term analytical variation derived from subject samples and control serum. Clin Chem 1977;23:100–4. Google Scholar
Pickup JF, Harris EK, Kearns M, Brown SS. Intra-individual variation of some serum constituents and its relevance to population-based reference ranges. Clin Chem 1977;23:842–50. Google Scholar
Harris EK, Cooil BK, Shakarji G, Williams GZ. On the use of statistical models of within-person variation in long-term studies of healthy individuals. Clin Chem 1980;26:383–91. Google Scholar
Harris EK. Step-by-step algorithm for computing critical range of next observation based on previous time series. Clin Chem 1977;23:2179–80. Google Scholar
Harris EK, Brown SS. Temporal changes in the concentrations of serum constituents in healthy men. Distributions of within-person variances and their relevance to the interpretation of differences between successive measurements. Ann Clin Biochem 1979;16:169–76.Google Scholar
Harris EK. Reference values for change: an addendum. Clin Chem 1983;29:997.Google Scholar
Harris EK, Yasaka T. On the calculation of a “reference change” for comparing two consecutive measurements. Clin Chem 1983;29:25–30.Google Scholar
Queralto JM, Boyd JC, Harris EK. On the calculation of reference change values, with examples from a long-term study. Clin Chem 1993;39:1398–403. Google Scholar
Harris EK, Yasaka T, Horton MR, Shakarji G. Comparing multivariate and univariate subject-specific reference regions for blood constituents in healthy persons. Clin Chem 1982;28:422–6. Google Scholar
Harris EK, Boyd JC. On dividing reference data into subgroups to produce separate reference ranges. Clin Chem 1990;36:265–70. Google Scholar
Harris EK, Wong ET, Shaw ST, Jr. Statistical criteria for separate reference intervals: race and gender groups in creatine kinase. Clin Chem 1991;37:1580–2. Google Scholar
Harris EK. Statistical aspects of reference values in clinical pathology. Prog Clin Pathol 1981;8:45–66. Google Scholar
Harris EK, Yasaka T, editors. Maintaining a healthy state within the individual. Ifip-Imia Working Conference on Maintaining a Healthy State Within the Individual; 1986; Kobe, Japan: North Holland. Google Scholar
Harris EK, Boyd JC. Statistical bases of reference values in laboratory medicine. New York: M. Dekker, 1995. Google Scholar
Stanislas cohort List (see Supplementary Data, which accompanies the article at http://www.degruyter.com/view/j/cclm.2012.51.issue-1/issue-files/cclm.2013.50.issue-1.xml).
Vincent-Viry M, Schiele F, Gueguen R, Bohnet K, Visvikis S, Siest G. Biological variations and genetic reference values for apolipoprotein E serum concentrations: results from the Stanislas Cohort study. Clin Chem 1998;44:957–65. Google Scholar
Directive 98/79/CE. Brussels: European Community, 1998. Google Scholar
EN ISO 15189 (2007). Medical Laboratories – particular requirements for quality and competence. Geneva, ISO, 2007. Google Scholar
Fuentes-Arderiu X. Biological reference intervals and ISO 15189. Clin Chim Acta 2006;364:365–6. Google Scholar
Henny J, Petitclerc C, Fuentes-Arderiu X, Petersen PH, Queralto JM, Schiele F, et al. Need for revisiting the concept of reference values. Clin Chem Lab Med 2000;38:589–95. Google Scholar
Cruze-Carlos LM, Monge Azemar N, Fuentes-Arderiu X. Report of the II European symposium on clinical laboratory and in vitro diagnostics industry: “physiological reference values: a shared business”. Clin Chim Acta 2003;338:165–9. CrossrefGoogle Scholar
Petersen PH, Henny J, editors. Reference values. Special Issue, Clin Chem Lab Med 2004;42:685. Google Scholar
Thue-Ronn P, Blaabjerg O, Hyltoft Petersen P. Reference intervals for plasma proteins with non-insulin dependent diabetes mellitus. Upsala J Med Sci 1994;99:363–72. Google Scholar
Hyltoft Petersen P, Jensen EA, Brandslund I. Analytical performance, reference values and decision limits. A need to differentiate between reference intervals and decision limits and to define analytical quality specifications. Clin Chem Lab Med 2012;50:819–31. Google Scholar
Plebani M, editor. Reference values. Special Issue, Clin Chem Lab Med 2012;50:761. Google Scholar
Rustad P, Felding P, Lahti A. Proposal for guidelines to establish common biological reference intervals in large geographical areas for biochemical quantities measured frequently in serum and plasma. Clin Chem Lab Med 2004;42:783–91. Google Scholar
Rustad P, Felding P, Lahti A, Hyltoft Petersen P. Descriptive analytical data and consequences for calculation of common reference intervals in the Nordic Reference Interval project 2000. J Clin Lab Invest 2004;64:343–70.Google Scholar
Rustad P, Felding P, Franzson L, Kairisto V, Lahti A, Martensson A, et al. The Nordic Reference Interval Project 2000: recommended reference intervals for 25 common biochemical properties. J Clin Lab Invest 2004;64:271–84. Google Scholar
Ichihara K, Itoh T, Min WK, Yap SF, Lam CW, Kong XT, et al. Diagnostic and epidemiological implications of regional differences in serum concentrations of proteins observed in six Asian cities. Clin Chem Lab Med 2004;42:800–9. Google Scholar
Ichihara K, Itoh Y, Lam CW, Poon P, Kim JH, Kyono H, et al. Sources of variation of commonly measured serum analytes in 6 Asian cities and consideration of common reference intervals. Clin Chem 2008;54:356–65.CrossrefGoogle Scholar
Ruan C, Wu Y, Okada T, Motoi S, Kondo T, Jiang M, et al. Studies on reference intervals for platelet counts in three cities in China and one in Japan. Thromb Haemost 2002;88:111–4. Google Scholar
Fuentes-Arderiu X, Mas-Serra R, Aluma-Trullas A, Marti-Marcet MI, Dot-Bach D. Guideline for the production of multicentre physiological reference values using the same measurement system. A proposal of the Catalan Association for Clinical Laboratory Sciences. Clin Chem Lab Med 2004;42:778–82. Google Scholar
Fuentes-Arderiu X, Ferré-Masferrer M, Gonzalez-Alba JM, Escola-Aliberas J, Balsells-Rosello D, Blanco-Cristobal C, et al. Multicentric reference values for some quantities measured with Tina-quant reagents systems and RD/Hitachi analysers. J Clin Lab Invest 2001;61:273–6. Google Scholar
Fuentes-Arderiu X, Ferré-Masferrer M, Gonzalez-Alba JM, Villarino-Gonzales MI, Arrimadas-Esteban E, Cabrero-Olivé D, et al. Multicentric reference values for some quantities measured with Elecsys 2010 analyser. Clin Chim Acta 2001;304:143–6.CrossrefGoogle Scholar
Ferré-Masferrer M, Fuentes-Arderiu X, Gomà-Llonguers M, Alumà-Trulàs A, Aramendi-ramos M, Castano-Vidriales JL. Regional reference values for some quantities measured with the ADVIA Centaur analyser. A model of cooperation between the in vitro diagnostic industry and clinical laboratories. Clin Chem Lab Med 2001;39:166–9.Google Scholar
Fuentes-Arderiu X, Mas-Serra R, de-Nó-Lengaran C, Espejo-López F, Filgueiras-Martínez P, Gassiot-Cordomí P, et al. Multicentre physiological reference values for some urinary component-to-creatinine (creatininium) concentration ratios. Clin Chem Lab Med 2005;43:958–62.Google Scholar
Fuentes-Arderiu X, Alvarez-Funes V, Coca-Fábregas L, Cruz-Placer M, Díaz-Fernández J, Herrero-Bernal P, et al. Multicentre physiological reference values for the concentration of creatininium in plasma and diagnostic specificity of glomerular filtration rate estimated with the MDRD equation. Clin Chem Lab Med 2007;45:531–4.Google Scholar
Fuentes-Arderiu X, Alonso-Gregorio E, Alvarez-Funes V, Ambrós-Marigómez C, Coca-Fábregas L, Cruz-Placer M, et al. Multicentre physiological reference intervals for serum concentrations of immunoglobulins A, G and M, complement C3c and C4 measured with Tina-Quant reagents systems. Clin Chem Lab Med 2007;45:387–90. Google Scholar
Ceriotti F. Prerequisites for use of common reference intervals. Clin Biochem Rev 2007;28:115–21. Google Scholar
Haeckel R, Wosniok W, Arzideh F. A plea for intra-laboratory reference limits. Part 1. General considerations and concepts for determination. Clin Chem Lab Med 2007;45:1033–42. Google Scholar
Arzideh F, Wosniok W, Gurr E, Hinsch W, Schumann G, Weinstock N, et al. A plea for intra-laboratory reference limits. Part 2. A bimodal retrospective concept for determining reference limits from intra-laboratory databases demonstrated by catalytic activity concentrations of enzymes. Clin Chem Lab Med 2007;45:1043–57. Google Scholar
Arzideh F, Wosniok W, Haeckel R. Reference limits of plasma and serum creatinine concentrations from intra-laboratory data bases of several German and Italian medical centres: comparison between direct and indirect procedures. Clin Chim Acta 2010;411:215–21. CrossrefGoogle Scholar
Arzideh F, Wosniok W, Haeckel R. Indirect reference intervals of plasma and serum thyrotropin (TSH) concentrations from intra-laboratory data bases from several German and Italian medical centres. Clin Chem Lab Med 2011;49:659–64. Google Scholar
Kallner A, Gustavsson E, Hendig E. Can age and sex related reference intervals be derived for non-healthy and non-diseased individuals from results of measurements in primary health care? Clin Chem Lab Med 2000;38:633–54. Google Scholar
Lahti A, Hyltoft Petersen P, Boyd JC, Fraser CG, Jørgensen N. Objective criteria for partitioning Gaussian-distributed reference values into subgroups. Clin Chem 2002;48:338–52. Google Scholar
Lahti A, Hyltoft Petersen P, Boyd JC. Impact of subgroup prevalences on partitioning of Gaussian-distributed reference values. Clin Chem 2002;48:1987–99.Google Scholar
Lahti A. Partitioning biochemical reference data into subgroups: comparison of existing methods. Clin Chem Lab Med 2004;42:725–33. Google Scholar
Boyd JC. Reference regions of two or more dimensions. Clin Chem Lab Med 2004;42:739–46. Google Scholar
Ichihara K, Boyd JC; IFCC Committee on Reference Intervals and Decision Limits (C-RIDL). An appraisal of statistical procedures used in derivation of reference intervals. Clin Chem Lab Med 2010;48:1537–51. Google Scholar
Horn PS, Pesce AJ, Copeland BE. Reference interval computation using robust vs parametric and nonparametric analyses. Clin Chem 1999;45:2284–5. Google Scholar
Horn PS, Pesce AJ, Copeland BE. A robust approach to reference interval estimation and evaluation. Clin Chem 1998;44:622–31. Google Scholar
Horn PS, Feng L, Li Y, Pesce AJ. Effect of outliers and non-healthy individuals on reference interval estimation. Clin Chem 2001;47:2137–45. Google Scholar
Panteghini M, Ceriotti F. Obtaining reference intervals traceable to reference measurement systems: is it possible, who is responsible, what is the strategy? Clin Chem Lab Med 2012;50:813–7. Google Scholar
Steinmetz J, Schiele F, Gueguen R, Férard G, Henny J; Periodic Health Examination Centers Laboratory Working Group. Standardization of gamma-glutamyltransferase assays by intermethod calibration. Effect on determining common reference limits. Clin Chem Lab Med 2007;45: 1373–80. Google Scholar
National Committee for Clinical Laboratories Standards. How to define and determine reference intervals in the clinical laboratory: approved guideline, NCCLS document C28-A. Wayne, PA: National Committee for Clinical Laboratories Standards, 1995;15:59. Google Scholar
Clinical and Laboratory Standards Institute. How to define and determine reference intervals in the clinical laboratory: approved guideline, 2nd ed. CLSI document C28-A2. Wayne, PA: Clinical and Laboratory Standards Institute, 2000. Google Scholar
Clinical and Laboratory Standards Institute. Defining, establishing and verifying reference intervals in the clinical laboratory; approved guidelines, 3rd ed. CLSI document C28-A3. Wayne, PA: Clinical and Laboratory Standards Institute, 2008;28:61. Google Scholar
Henny J. The IFCC recommendations for determining reference intervals: strengths and limitations. J Lab Med 2009;33:45–51. Google Scholar
Ceriotti F, Boyd JC, Klein G, Henny J, Queraltó J, Kairisto V, et al. Reference intervals for serum creatinine concentrations: assessment of available data for global application. Clin Chem 2008;54:559–66. CrossrefGoogle Scholar
Ceriotti F, Henny J, Queraltó J, Ziyu S, Özarda Y, Chen B, et al. Common reference intervals for aspartate aminotransferase (AST), alanine aminotransferase (ALT) and γ-glutamyl transferase (GGT) in serum: results from an IFCC multicenter study. Clin Chem Lab Med 2010;48:1593–601. Google Scholar
Ceriotti F, Henny J. “Are my laboratory results normal?” Considerations to be made concerning Reference Intervals and Decision Limits. Available from: http://www.ifcc.org/ifcc-communications-publications-division-%28cpd%29/ifcc-publications/ejifcc-%28journal%29/e-journal-volumes/vol-19-n%C2%B0-2/are-my-laboratory-results-normal-considerations-to-be-made-concerning-reference-intervals-and-de/. Accessed 7 September, 2012 .
Haeckel R, Wosniok W, Arzideh F. Proposed classification of various limit values (guide values) used in assisting the interpretation of quantitative laboratory test results. Clin Chem Lab Med 2009;47:494–7. Google Scholar
Horn PS, Pesce AJ. Effect of ethnicity on reference intervals. Clin Chem 2002;48:1802–4. Google Scholar
Johnson AM, Hyltoft Petersen P, Whicher JT, Carlström A, MacLennan S; International Federation of Clinical Chemistry and Laboratory Medicine, Committee on Plasma Proteins. Reference intervals for serum proteins: similarities and differences between adult Caucasian and Asian Indian males in Yorkshire, UK. Clin Chem Lab Med 2004;42: 792–9. Google Scholar
Lepage N, Huang SH, Nieuwenhuys E, Filler G. Pediatric reference intervals for immunoglobulin G and its subclasses with Siemens immunonephelometric assays. Clin Biochem 2010;43:694–6. CrossrefGoogle Scholar
Henderson MP, Grey V. Establishing and evaluating pediatric thyroid reference intervals on the Roche Modular Analytics E 170 using computational statistics and data-mining techniques. Clin Biochem 2011;44:767–70.Google Scholar
Langer C, Tambyrayah B, Thedieck S, Nowak-Göttl U. Testing for lipoprotein(a) concentration and apolipoprotein(a) phenotypes: method standardization and pediatric reference values. Semin Thromb Hemost 2011;37:810–3.CrossrefGoogle Scholar
Christensen RD, Baer VL, Gordon PV, Henry E, Whitaker C, Andres RL, et al. Reference ranges for lymphocyte counts of neonates: associations between abnormal counts and outcomes. Pediatrics 2012;129:e1165–72. CrossrefGoogle Scholar
Chan MK, Seiden-Long I, Aytekin M, Quinn F, Ravalico T, Ambruster D, et al. Canadian Laboratory Initiative on Pediatric Reference Interval Database (CALIPER): pediatric reference intervals for an integrated clinical chemistry and immunoassay analyzer, Abbott ARCHITECT ci8200. Clin Biochem 2009;42:885–91. Google Scholar
Blasutig IM, Jung B, Kulasingam V, Baradaran S, Chen Y, Chan MK, et al. Analytical evaluation of the VITROS 5600 Integrated System in a pediatric setting and determination of pediatric reference intervals. Clin Biochem 2010;43: 1039–44.Google Scholar
Kulasingam V, Jung BP, Blasutig IM, Baradaran S, Chan MK, Aytekin M, et al. Pediatric reference intervals for 28 chemistries and immunoassays on the Roche cobas 6000 analyzer – a CALIPER pilot study. Clin Biochem 2010;43:1045–50.CrossrefGoogle Scholar
Colantonio DA, Kyriakopoulou L, Chan MK, Daly CH, Brinc D, Venner AA, et al. Closing the gaps in pediatric laboratory reference intervals: a CALIPER database of 40 biochemical markers in a healthy and multiethnic population of children. Clin Chem 2012;58:854–68. CrossrefGoogle Scholar
Nilsson SE, Evrin PE, Tryding N, Berg S, McClearn G, Johansson B. Biochemical values in persons older than 82 years of age: report from a population-based study of twins. Scand J Clin Lab Invest 2003;63:1–13. CrossrefGoogle Scholar
Huber KR, Mostafaie N, Stangl G, Worofka B, Kittl E, Hofmann J, et al. Clinical chemistry reference values for 75-year-old apparently healthy persons. Clin Chem Lab Med 2006;44:1355–60.Google Scholar
Millán-Calenti JC, Sánchez A, Lorenzo-López L, Maseda A. Laboratory values in a Spanish population of older adults: a comparison with reference values from younger adults. Maturitas 2012;71:396–401. CrossrefGoogle Scholar
Gilbert RM, Hadlow NC, Walsh JP, Fletcher SJ, Brown SJ, Stuckey BG, et al. Assessment of thyroid function during pregnancy: first-trimester (weeks 9–13) reference intervals derived from Western Australian women. Med J Aust 2008;189:250–3. Google Scholar
Klajnbard A, Szecsi PB, Colov NP, Andersen MR, Jørgensen M, Bjørngaard B, et al. Laboratory reference intervals during pregnancy, delivery and the early postpartum period. Clin Chem Lab Med 2010;48:237–48. Google Scholar
Shahabi P, Siest G, Herbeth B, Ndiaye NC, Visvikis-Siest S. Clinical necessity of partitioning of human plasma haptoglobin reference intervals by recently-discovered rs2000999. Clin Chim Acta 2012;413:1618–24. CrossrefGoogle Scholar
Geffré A, Grollier S, Hanot C, Vergez F, Trumel C, Braun JP. Canine reference intervals for coagulation markers using the STA Satellite and the STA-R Evolution analyzers. J Vet Diagn Invest 2010;22:690–5.CrossrefGoogle Scholar
Bourgès-Abella N, Geffré A, Concordet D, Braun JP, Trumel C. Canine reference intervals for the Sysmex XT-2000iV hematology analyzer. Vet Clin Pathol 2011;40:303–15. Google Scholar
Rustad P, Simonsson P, Felding P, Pedersen M. Nordic Reference Interval Project Bio-bank and Database (NOBIDA): a source for future estimation and retrospective evaluation of reference intervals. Scand J Clin Lab Invest 2004;64:431–8. CrossrefGoogle Scholar
Fraser CG. Reference change values. Clin Chem Lab Med 2012;50:807–12. Google Scholar
American Association for Clinical Chemistry (AACC). Lab Tests Online. Available from: http://labtestsonline.org/understanding/features/ref-ranges/. Accessed 10 September, 2012.
About the article
Prof. Gérard Siest received his pharmacy diploma, his specialization in laboratory medicine and his PhD from the Universities of Strasbourg and Nancy. He was Professor of Biochemistry, Molecular Biology and Molecular Pharmacology and in charge of the direction of the Center for Preventive Medicine laboratory and some years later all the research of this health screening organization. He developed a research team linked to CNRS and INSERM dealing with drug metabolism, more particularly the UDPGTs and on the genetic influence on laboratory tests and on reference values. Two proteins were studied more deeply: apolipoprotein E and gammaglutamylstransferase. He created and was in charge of the regional postgraduate course on drug metabolism and biochemical pharmacology. He was President of the European Society of Biochemical Pharmacology and of the International Federation of Clinical Chemistry (IFCC) for 6 years. Gérard Siest was Editor-in-Chief of Clinical Chemistry and Laboratory Medicine (CCLM) from 1998 until 2008. In 2010, he founded a new European Society of Pharmacogenetics and Theranostics (ESPT) for which he is now President and starting to be Editor-in-Chief of the journal Drug Metabolism and Drug Interactions (DMDI) that is also the official journal of ESPT. In addition, he is involved in many boards of pharmacogenomics and laboratory medicine journals. He has published over 600 peer review publications. The scientific conferences organized every 2 years, in Santorini (Greece) under his responsibility are very successful. The last one was held from 30 September to 2 October 2012.
Joseph Henny pursued his studies at the University of Lorraine (formerly Nancy). Then he served as director of the clinical laboratory at the Centre for Preventive Medicine in Nancy. Since 2009, he is responsible for the constitution of a Biobank devoted to large population studies related to the ‘Constances Cohort Project’ launched by UMR INSERM 1018 (Centre for Research in Epidemiology and Population Health) – Versailles Saint-Quentin University. He is also in charge of the scientific and technical coordination of the Medical Laboratories Network of the French Health Periodic Examination Centres. He is participating in several regional and national programs in the domains of Public Health in collaboration with regional and/or national health operators (such as French Institute for Public Health Surveillance). He serves as reviewer for various journals. His scientific interest is primarily in the field of the study of the effects of biological variations on laboratory tests, he is working on development of reference values and related concepts. Further, he has been an active participant in numerous studies seeking to better understand biologic and pre-analytical variability, including standardization of laboratory tests. Of note, since the 1980s Joseph Henny has been actively involved in the Société Française de Biologie Clinique (SFBC). He has also occupied the post of treasurer and general secretary of the SFBC. He is an active member of several national scientific societies and member of several working groups and committee at national and international level. He was a full member of the Reference Intervals and Decision Limits Committee (C-RIDL) of the International Federation of Clinical Chemistry and Laboratory Medicine (IFCC-LM).
Published Online: 2012-11-24
Published in Print: 2013-01-01