Improving homeopathic prescribing by applying epidemiological techniques: the role of likelihood ratio

CF Stolper1, ALB Rutten2, RFG Lugten3, RJWM Barthels4
Commissie Methode en Validering VHAN (Dutch Association of Homeopathic Physicians); 1 Heerde, Netherlands; 2 Breda, Netherlands; 3 Gouda, Netherlands; 4 Drunen, Netherlands

A committee of the Dutch association of homeopathic physicians is trying to validate materia medica by evaluating successful cases. These cases are presented and assessed by a group of experienced homeopathic physicians to provide indications about the prevalence of symptoms related to particular homeopathic medicines. The next logical question is whether epidemiological techniques can be applied to them. We have some ideas concerning the information these data can provide, based on epidemiological theory and limited experimental data. Theoretical investigation suggests that the epidemiological concept of likelihood ratio is well adapted to homeopathy. Researching and applying likelihood ratio will lead to more accurate materia medica and repertory. These considerations already indicate some shortcomings in the representation of rare medicines and the use of grading in the current repertories. Homeopathy 91, 230-238

Introduction

Some epidemiological concepts

First we will give an outline of the opportunities offered to homeopathy by epidemiology. In conventional medicine we are used to assessing diagnostic tests. To assess a diagnostic test we need a gold standard to compare the test with.(1) The gold standard is regarded as the best approximation of the truth. For instance, to assess ultrasonography for diagnosis of gallstones the best standard is the result of laparotomy. After laparotomy we can divide patients in four groups, according to outcome:

a: the test (ultrasonography) is positive and the illness (gallstones) is present: true positive

b: the test is positive and the illness is absent: false positive

c: the test is negative and the illness is present: false negative

d: the test is negative and the illness is absent: true negative

This can be depicted in a 2x2 table (Table 1):

 

illness present

illness absent

 

test positive

a

b

a+b

test negative

c

d

c+d

 

a+c

b+d

a+b+c+d

Table 1: 2x2 table showing relationships between the results of diagnostic tests and the presence of illness

We will use the notation a-d to indicate the possible results of a test.

These values can be assessed for every possible test. For example, we can assess the effectiveness of a drug-detecting dog in a group of travellers where we know who is carrying drugs and who is not (Table 2).

 

Drugs

No drugs

 

Barks

95(a)

50(b)

145

Does not bark

5(c)

50(d)

55

 

100

100

200

Table 2: assessing a test: the accuracy of a sniffer dog in detecting drugs

These tables imply statistical terms like specificity and sensitivity. These can be confusing and we do not need hem, as you will see.

After testing the dog we want to use him. If the dog barks we can estimate the chance that the traveller carries drugs; this is the positive diagnostic value (DV+). If the same dog does not bark how great is the chance that we let the traveller rightfully pass? We define this as the negative diagnostic value (DV-).

The mathematical notation according to the 2x2 table is:

DV+ = a/(a+b)

DV- = d/(c+d)

If, as in the situation where we tested the dog, half of the passengers are carrying drugs, the positive diagnostic value of the dog barking is 95/145 = 66%. If the dog does not bark the chance that the traveller is not carrying drugs is 50/55 = 91%.

What happens if the circumstances are different, e.g. only 50 out of 200 travellers are carrying drugs? The 2x2 table for the same dog is given in Table 3:

 

Drugs

No drugs

 

Barks

47(a)

75(b)

122

Does not bark

3(c)

75(d)

78

 

50

150

200

Table 3: Accuracy of sniffer dogs with lower incidence of carrying drugs

In this case the chance of anyone carrying drugs when the dog barks is 47/122 = 39%. If the dogs does not bark the chance of the passenger being clean is 75/78 = 96%. Therefore this dog is not suited for a situation where the probability of a traveller carrying drugs is low, because there are too many false alarms (75 false alarms, only three false negatives).

The usefulness of a test depends on the situation, i.e. the chance of a fact being present before the test is performed (prior-chance). Some tests perform well if the prior-chance is low and badly if the prior-chance is high or vice versa.

In a homeopathic consultation we have a similar situation. We are testing the possibility of a medicine by asking question indicating that medicine. At the beginning of a consultation many medicines are possible, so the prior-chance of a medicine is low. If, at a later stage, our differential diagnosis is confined to, say, four medicines the prior-chance is 25%. So the same question can have a different diagnostic value in different stages of the consultation.

Likelihood ratio

The fact that the diagnostic value is not a constant value but depending on the prior-chance is awkward when we try to compare tests. Therefore, we use another value: the likelihood ratio(LR).(2) The likelihood ratio is a constant that indicates the relation between prior-chance and posterior-chance (the chance after the test). This relation is given by a somewhat complicated formula:

Posterior-chance (+) = (((prior-chance/(1 – prior-chance)) x LR(+)) / (((prior-chance/(1-prior-change))xLR + 1).

Let us leave this formula and concentrate on what is useful for us. The LR indicates how the result of a test changes the possibility of a diagnosis in the light of what was already known. This is also known as ‘Bayes’ theorem’. The LR can indicate the change if the test is positive (defined as LR+) and if the test is negative (defined as LR-). The mathematical formulas regarding the parameters in the 2x2 table are:

LR+ = (a/(a+c)) / (b/(b+d))

LR- = (c/(a+c)) / (d/(b+d))

If LR = 1 nothing changes. The higher the LR(+) the better the test if the result is positive. For a negative result the test is better if the LR(-) is closer to zero. As an example we take the ultrasonography for the diagnosis of appendicitis. Literature(3) shows that the LR(+) = 7.6 and the LR(-) = 0.27. Now we can calculate the posterior-chance for all prior-chances using the formula mentioned above. This is graphically represented in Figure 1.


Figure 1 Ultrasonography and appendicitis: LR+ = 7.6; LR- = 0.27

This graph shows how the probability of appendicitis changes after positive (L+) and after negative (L-) ultrasonography. If our suspicion of the existence of appendicitis was 33%, the probability after the test rises to 79%, a negative ultrasonography would lower the probability to 12%.

Likelihood ratio and homeopathy

There are differences between diagnostic tests like ultrasonography and the questions used in homeopathy. The most important problem is the lack of a ‘gold standard’ such as the visible presence of gallstones. There is a somewhat vague general understanding about the meaning of ‘cure’. One of the main principles in our method is the relation between a symptom and the curative effect of a medicine. The symptom indicates that a medicine is more likely to have an effect than we could expect it by mere coincidence. The ‘gold standard’ in homeopathy is the fact that the medicine worked. Instead of the diagnostic value of a test we measure the prognostic value of a symptom.

The 2x2 diagram for a homeopathic symptom is shown in Table 4:

 

medicine worked

rest

 

symptom present

a

b

a+b

symptom absent

c

d

c+d

 

a+c

b+d

a+b+c+d

Table 4: In this table:
a+c = all the patients that got the medicine with a positive effect
b+d = all the other patients, including the ones that got the medicine without positive effect

Questions as diagnostic instruments

In homeopathy we ask lots of questions to get ideas about possible medicines or to confirm medicines that might be applicable. Some questions seem more effective than others, and some symptoms seem related to each other forming patterns and rendering the combination of these symptoms more important than the sum of the individual symptoms. In this respect homeopathy does not differ from conventional medicine. In the conventional medical diagnostic process, instruments are important, but interrogation is still important to narrow down the differential diagnosis. Can we assess questions the same way we assess diagnostic tests?(4)

At the moment our questions are based on our materia medica that is based mainly on opinion and clinical reasoning. Heuristics and biases trouble this kind of information.(5) If possible we want to update opinion-based information with clinical evidence.

Since several years a group of Dutch homeopathic physicians is meeting on a regular basis to validate materia medica. First we reached agreement upon the quality of cases using a specification of the Glasgow Homeopathic Scale.(6) The participants bring their cases involving a particular homeopathic medicine. These cases are discussed and, if approved, the symptoms of each case are gathered in a database. In our experience less than 10% of all cases of each practitioner reach the score 3 or 4 on this scale. This way we hope to assess the prevalence of the symptoms regarding our medicines. At the moment numbers are small, but we hope that our enthusiasm will stimulate our colleagues to increase these numbers. Now we have gathered a limited amount of data from which we want to draw preliminary conclusions. But first some theoretical considerations.

We take the symptom as the diagnostic test and the medicine as the illness to be diagnosed. If we take a closer look at the formula for the LR(+) we see something very interesting:

a/(a+c) is for the prevalence of the symptom in the population that responds to a medicine.

b/(b+d) is for the prevalence of the symptom in the population that does not respond to that medicine.

So the LR(+) = (a/(a+c))) / (b/(b+d)) = (prevalence of the symptom with the medicine) / (prevalence of the symptom with all others). Or in words: The likelihoodratio (+) of a symptom compares the presence of that symptom in the successful prescriptions of that medicine, with the frequency of this symptom in the unsuccessful prescription of the same medicine and all prescriptions of other medicines.

If the symptom is more frequently present where the medicine was successful than in the rest of the population the LR(+)>1. In other words the more the symptom is confined to the medicine (and not to the rest) the higher the likelihood ratio(+). In fact the likelihood ratio(+) is a mathematical representation of §153 of the Organon by Samuel Hahnemann that states: pay particular attention to the peculiar and characteristic symptoms.

The graph representing a peculiar symptom, usually with a high LR(+) and a moderate LR(-), could look like Figure 2.


Figure 2 The prognostic value of a peculiar or characteristic symptom LR+=40; LR-=0.6

An old rule can be translated into modern terminology, and there are several advantages in doing so. The most important advantage is a more accurate and quantitative description of the importance of a symptom, based on empirical data. This implies of course the gathering of the necessary data.

‘Loquacity’ and Lachesis

One of the medicines we validated is Lachesis.(7) Sixteen cases were assessed and labelled with a 3-4 score on the GHHOS scale. Here we show only some of the reported symptoms:

Inflammations

6

Dreams about snakes

1

Hormonal complaints

3

Loses his way

1

Loquacity

7

Swallowing <

1

Jealousy

3

Irritable on waking

1

Left-sided complaints

3

Right-sided complaints

1

Of course, these small numbers allow no definite conclusions. All the symptoms that were recorded only once could be present by chance. Their prevalence could vary widely around 6%. There are also some vague symptoms like ‘Inflammations’ and ‘Hormonal complaints’.

Despite all these shortcomings we want to show how a likelihood ratio for a homeopathic symptom could be presented. The prevalence of the symptom ‘Loquacity’ in the successful prescriptions is 44% (95% Confidence Interval: 21.1-78.9). We estimate (based on expert opinion) the prevalence of ‘Loquacity’ in the rest of the population to be 10%. Assuming that these values were confirmed with large groups the LR(+) would be 4.38 and LR(-)=0.63 (see Figure 3).


Figure 3 Loquacity and Lachesis LR+=4.38; LR-=0.63

In this graph we see a LR lower than for ultrasonography and appendicitis so the symptom ‘Loquacity’ gives less confidence for Lachesis than ultrasonography does for appendicitis. This is as expected because we know that only loquacity is not enough to prescribe Lachesis. But if we had to choose between three medicines (prior-chance=0.33) the posterior-chance for Lachesis rises to 0.69 is the patient is loquacious. If the symptom is negative the posterior-chance lowers to 0.24. Therefore this symptom is not appropriate to exclude Lachesis.

Pitfalls and limitations

If we want to base homeopathy on accurate and quantitative data we have to be sure that our algorithm is correct, to paraphrase Disraeli ‘There are lies, damned lies and epidemiology’. If theory and experience in practice do not match something is wrong. It might be the formula or the data that is wrong. At present it is not possible to test the usefulness of the LR in homeopathy, Which is mainly because we have no data about the prevalence of our symptoms in our patient population, and we are just starting to gather data about successful cases. The prevalence of a symptom in patients responding to a particular homeopathic medicine will be determined by collecting a sufficient number of cases that showed a curative effect from that medicine. But even then we must be careful. Homeopathic symptoms are not easy to define, nor are homeopathic effects. Also, present materia medica and repertory are not very clear about the exact meaning of many symptoms.

Vagueness

There are different kinds of vagueness in homeopathic symptoms. First of all there are semantics, e.g. what is ‘jealousy’? Then there are different grades of intensity. Take, for instance, headache, mild headache can be a symptom in almost any medicine. If we draw a 2x2 table for this symptom and the role of Belladonna in this respect we might get something like:

 

Belladonna

Not Belladonna

 

Headache

9

791

800

No headache

1

199

200

 

10

990

1000

Table 5 Belladonna and mild headache. LR+=1.13

The likelihood ratio (+) = 1.13 in this case.

But if we could define a severe kind of headache that has a low prevalence in the general population, but a high prevalence in the ‘Belladonna-population’, the 2x2 table could be like:

 

Belladonna

Not Belladonna

 

Headache

2

8

10

No headache

8

982

990

 

10

990

1000

Table 6 Belladonna and severe headache. LR+=25

In this case the likelihood ratio (+)=25. This seems consistent with the fact that in self-medication Belladonna appears to be a frequently applied medicine for headache.

The other kind of vagueness is the result of the medicine. There is no dichotomy like cure or no-cure. In the GHHOS-scale we distinct four grades of success, but is not yet validated. During our meetings we frequently have different opinions about the result in a particular case and about the causal relation between cure and medicine. These matters have to be investigated further.

Interpretation

The way symptoms are assessed by homeopathic physicians needs more investigation. A homeopathic physician will base symptoms like ‘Loquacity’, ‘Jealousy’ and ‘Desire for coffee’ not only on the response of the patient to these questions, but also on his own observations. If the complaint started after birth of a brother or sister he will consider ‘Jealousy’ despite the denials. The symptom ‘Loquacity’ will frequently be based on observations during the consultation. ‘Desire for coffee’ is related to various sociological circumstances. During our meetings we frequently hear remarks like "I took this symptom as important because it is rare in kind (or degree) in these circumstances".

In our example of ‘Loquacity’ indicating Lachesis we used retrospective data. This kind of data is liable to confirmation bias: when Lachesis has worked ‘Loquacity’ can easily be recognised by hindsight. Prospective gathering of data would be more reliable.

Rare medicines; typeface, prevalence and likelihood

The importance of a symptom in relation to a certain medicine is represented by the typefaces in homeopathic repertories. Bold type or bold and underlined, represent the most important symptoms of medicines. There are, however, some inconsistencies in the repertory. One is the representation of rare medicines. Changing typefaces on the basis of LR and power of the argument could correct this shortcoming.

The meaning of the typefaces in the repertory is not very clear. Kent gives no explanation in the preface to his repertory;(8) he merely explains what kind of symptoms is more valuable. We find some explanation in one of the sources of the repertory, Hering’s ‘Guiding symptoms’.(9) Hering gives indications for: ‘Symptoms occasionally confirmed’, ‘Symptoms more frequently confirmed’, ‘Symptoms verified by cures’ and ‘Symptoms repeatedly verified’. When a symptom is frequently confirmed in the treatment with a certain medicine its becomes more important, especially when the symptom is rare.

Rare medicines are medicines with little data. If there is little experience with a medicine, its symptoms are not frequently confirmed. This means that there is no emphasis for these symptoms in the repertory, even if the symptoms are characteristic for the medicine. As an example, consider the rubric "Fear of death" in the repertory of RADAR-synthesis (v. 7.2). This rubric contains 146 medicines, among them Aconitum (bold and underlined), Latrodectus mactans (plain) and Sulphur (plain). In this repertory Aconitum has 4376 symptoms, Latrodectus mactans has 109 symptoms and Sulphur has 11451 symptoms.

In Clarke’s materia medica,(10) ‘Fear of death’ is an important symptom for Aconitum. For Sulphur ‘Fear of death’ is not mentioned. For Latrodectus mactans ‘Fear of death’ is the most important symptom.

Latrodectus mactans

Latrodectus is a difficult medicine to handle because of the few data known for this medicine. Personal experience led to a surprising effect on a patient with cardiac problems. Several medicines had a moderate effect. Because of the persistent radiating pain from the heart to the axilla. Latrodectus was tried with good result. Aconitum had no effect on this patient; the second best medicine was Lycopodium. The fear of death in this patient was deep but not easily expressed. Since then we have used Latrodectus in cases with heart problems (esp. with pain radiating to the shoulder or axilla) and fear of death more frequently and with more success than Aconitum and sometimes with deep ‘constitutional’ effect.

This kind of experience is not uncommon for experienced colleagues. Maybe the only reason that a medicine is rarely prescribed is lack of knowledge. The more a medicine is used the greater the knowledge about it. The chance of finding a medicine by the use of the repertory is proportional to the amount of symptoms in the repertory. The chance of finding Aconitum is 40 times greater than the chance of finding Latrodectus. More use of these rare medicines seems justified.

If a medicine is seldom used there is not only the handicap of few recorded symptoms. The emphasis implied by typeface is also absent because of the infrequent use. We might suspect that ‘Fear of death’ is at least as important for Latrodectus as for Aconitum, in spite of the difference in typeface in the rubric. Let us investigate the advantage of using the LR instead of typeface. We assume that Latrodectus is prescribed once in about thousand prescriptions and that the prevalence of the symptom "Fear of death" is present in 60% of the cases where Latrodectus acted. Furthermore we assume that the prevalence ‘Fear of death’ is 10% in the rest of the population. The LR(+) of ‘Fear of death’ for Latrodectus is 6:

 

Latrodectus mactans

Other medicines

 

Fear death

6(a)

1000(b)

1006

no fear death

4(c)

9000(d)

9004

 

10

10000

10010

If we do the same for Aconitum, we assume that the prevalence of ‘Fear of death’ in this medicine is also 60%. Assuming that it is prescribed once in about every hundred prescriptions the figures in the 2x2 table are different (see appendix), but the LR(+) for ‘Fear of death’ of Aconitum is also 6:

 

Aconitum

Other medicines

 

Fear death

6(a)

100(b)

106

no fear death

4(c)

900(d)

904

 

10

1000

1010

In a former paragraph we showed the graphical representation of the likelihood ratio. The likelihood graph for this symptom with Aconitum and Latrodectus is like Figure 4:


Figure 4 Fear of death with Aconitum and Latrodectus mactans (estinated) LR+=6; LR-=0.44

We can do this for Sulphur assuming that the prevalence of this symptom is the same as in the rest of the population and that Sulphur is prescribed once in every 50 prescriptions. The likelihood ratio (+) for ‘Fear of death’ for Sulphur is 1 (but it might even be less than 1):

 

Sulphur

Other medicines

 

Fear death

1(a)

49(b)

122

no fear death

9(c)

441(d)

78

 

10

490

500

In the cases of Aconitum and Latrodectus we see a prior-chance of 16% climb to a posterior-chance of 53% after the symptom ‘Fear of death’ (see graph in the appendix). In the case of Sulphur the posterior-chance will remain 16%.

When we rely on our materia medica and when we have experience with a rare medicine, in this case Latrodectus, we will prescribe Latrodectus as many times as Aconitum, maybe even more. If we don’t have this experience and rely on the repertory we will frequently prescribe other medicines, like Sulphur, instead of Latrodectus in a case with ‘Fear of death’ despite the fact that this symptom is no clue for Sulphur. Of course these figures are estimated and we need to investigate them furthermore.

Repertory and likelihood ratio

We have shown that the original set-up of the repertory causes diagnostic problems for small medicines that can be solved by the use of LRs. But we need to look more closely at the content of symptom rubrics. We must reconsider the layout of the repertory in the light of LRs.

Misleading entries

If we consider a symptom rubric in the repertory the meaning of each medicine-entrance remains unclear. As we saw in the previous section a plain-type entry of a small medicine can stand for an important symptom. For the same reasons, it is possible that the entry of a large medicine is not justified.

Suppose we have half a million people cured by a homeopathic medicine and that, in this population, we have seen symptoms that occur in 80%, 60%, 40%, 20%, 10%, 1% and 0.1%. In this population Sulphur has acted in 10,000 cases, Aconitum in 4,000 cases and Lat-m. in 100 cases:

   

80%

symptom

60%

symptom

40%

symptom

20%

symptom

10%

symptom

1%

symptom

0.1%

symptom

population

500,000

400,000

300,000

200,000

100,000

50,000

5,000

500

sulphur

10,000

8,000

6,000

4,000

2,000

1,000

100

10

aconitum

4,000

3,200

2,400

1,600

800

400

40

4

Lat-.m

100

80

60

40

20

10

1

 

Of course these are different symptoms, a 20% symptom for Sulphur is different from a 20% symptom for Aconitum. If we remain in the same column of this table the LR = 1: When the occurrence of a symptom in the whole population is 5,000 (1%) the LR =1 for Lat-m. if the same symptom would occur once in the Lat-m. population (prevalence in the Lat-m.-population=1%). For Aconitum the absolute occurrence has to be 40 to obtain LR+=1 and for Sulphur the occurrence has to be 100. Here, again, we see the difficulty of concepts like ‘symptom frequently confirmed’. Symptoms will be more frequently confirmed for more frequently prescribed medicines.

If the occurrence of a symptom with Aconitum is 20% and in the whole population 10%, LR+=2.

If in this population ‘occasionally seen’ means 100 cases of Sulphur the LR=1 when the symptom has a prevalence of 1% in the whole population and LR+<1 when the symptom has a greater prevalence in the whole population.

Large rubrics (= rubrics with many medicines) stand for symptoms of high prevalence. If such a symptom is occasionally seen in the population of a large medicine like Sulphur the prevalence in the Sulphur-population is lower than the prevalence in the general population. Therefore it is likely that plain-type entries of a medicine with many symptoms in a large rubric stand for a LR=<1. This means that these symptoms cannot be used to include the medicine in our considerations. It does not mean that we can exclude it either! It is hard to define the size of rubrics where we can expect this, but possibly we should discard the plain-type entries of the 20 largest medicines in the 10% largest rubrics.

On the basis of our theoretical analysis we believe that it is possible to translate the qualitative concept ‘important symptom according to §153’ into the quantitative epidemiological concept ‘symptom with likelihood ratio of ..’. We have found some experimental evidence to confirm this idea.

We need to study the consequences for the repertory and our methodology.

We require more accurate data to compare prevalence of a symptom associated with a medicine with the prevalence of the symptom in the rest of the population. The best way to assess the prevalence of a symptom in successful prescriptions is to perform prospective studies, but we would need to do this for an enormous number of symptoms. This means significant organisational problems. Surveys to assess the presence of symptoms in the general population can also be used for the comparison between the medicine and the rest of the population. The design of the prospective study must provide the least possible interference with day-to-day practice.

The weight of the argument

‘The diagnostic ability of a doctor is also based on his capacity to estimate diagnostic data in a correct way’.(11) This is also relevant for the homeopathic prescribers because the value of the symptoms has an important role in the process of finding the similimum. Research into the way doctors estimate symptoms can be done by examination of the process of diagnostic decision making. LR is an useful instrument to describe criteria to select medicines- in a statistical way. Unfortunately doctors don’t think in numbers of LRs.(12) Still they can choose correct medicines. The question is now: how do we clear the relation between successful homeopathy and our chosen research-instrument, the LR?

Ideally, we would put after each medicine in each rubric of the repertory the LR, obtained from prospective research. We would expect the LR of the most homeopathic symptoms to range from 1 to 30. But in our experience it is not very easy to handle these numbers in our diagnostic thinking. Doctors usually think proportional, in categories of probability and not in a linear way. A diagnosis or the choice for a specific medicine is very improbable, possible, probable or very probable. Arguments such as the result of a test, the presence or absence of a symptom, are weak, moderate or strong arguments for or against a specific diagnosis or medicine. What we have to do is to correlate LRs with these weights of arguments. What LR ranges constitute weak, moderate or strong arguments? In clinical experience the difference between LR =1 and LR = 10 is greater than the difference between 10 and 20. A linear scale does not accurately reflect clinical importance. A natural scale in information theories is a logarithmic scale. For the homeopathic practice we propose that a 2-logarithm be used.

The logarithmic representation of LR looks as follows:

LR

log2 LR

1

0

2

1

4

2

8

3

16

4

32

5

As we supposed we expect the LR of most homeopathic symptoms to range from 1 to 30. On a logarithmic scale this is 0-5. The results of our prospective research will give us the possibility to put the log2-LR behind the medicines in the repertory. At this moment it is difficult to translate log2-LR’s in weights of arguments. But, in theory it is possible to quantitate the meaning of typefaces by weak (normal), moderate (italic) and strong (bold character) arguments. The range for weak arguments can be log2 LR 0-1, for good arguments 2-3 and for strong arguments 4 and higher. In the rubric MIND-ANGUISH the notation will be then "anac.(3)" or "ant-t.(1)" or "Bell.(5)",assuming that prospective investigation confirms the same importance of the symptom as the repertory.

Here we have estimated the LR based on intuitive assumptions, more or less based on practice experience. This is the way things have been done for over more than two centuries. We can do better by performing prospective studies (see ‘Towards research’).

Repertorisation is the process of putting different symptom-rubrics alongside each other in a spreadsheet. At present a blank in the repertorisation stands for ‘the medicine not present in that rubric’. This suggests that the symptom does not occur in relation to that medicine. This is inaccurate; a blank should stand for a likelihoodratio close to 1 indicating that the occurrence of the symptom with that medicine is the same as the occurrence with the rest of the (waiting room) population.

Towards research

If we want to perform this by prospective research on LR we will have to evaluate an enormous amount of symptoms. This requires a method that interferes with daily practice as little as possible.

Our first goal is to investigate the possibility of assessing the LR of homeopathic symptoms. Our long-term goal is to update our materia medica and repertory by means of statistical instruments that match the homeopathic methodology.

We will conduct this investigation for ourselves, it will not convince anybody that homeopathy works. We want to assess the best symptoms to include or exclude a certain medicine. The presence and the intensity of many symptoms are judged by clinical experience. We rely on this experience and do not want to standardise each item in the investigation more than necessary. If we want to convince others with this kind of research we risk discouraging patients and practitioners from participating.

Our main concern is the relevance of the likelihoodratio for practical use. We could, say, use elaborate and validated personality tests to assess if a person is jealous and base the likelihoodratio on the outcomes of this test. But when the practitioner uses his clinical expertise (and not the same personality test) during consultations he gets another kind of ‘jealousy’. The instrument (symptom) we investigate to determine the LR must be the same as the instrument used in practice. But it is clear that we have to define our symptoms as accurately as possible, i.e. better than in the current materia medica and repertory for many symptoms.

The gold standard in convincing others (high internal validity) is the randomised controlled clinical Trial (RCT). In such research interference with daily practice is significant. But we are already convinced of the importance of symptoms. We just want to know which symptoms are more important than others. The less interference with daily practice, the more co-operation we get.

Analysis

Which are the constituents of the standard homeopathic consultation that can be used to determine the LR? Homeopathic physicians develop the habit of asking a series of standard questions to get an overview of the patient’s health status and his ‘constitution’. This list of questions (but also observations) depends on personal experience but, generally speaking, there is consensus about the kind of questions that everyone asks during the first consultation.

For our first goal, the usefulness of LR, we can start with symptoms that most physicians regard as basic for the first consultation. If we research the likelihood ratio of a homeopathic symptom prospectively and with more participating physicians everyone should ask every patient included in the investigation the same question. If we take the symptom ‘Desire for coffee’ related to the medicine Angustura as an example the investigation could proceed as follows:

  1. We define the symptom ‘Desire for coffee’, e.g. more than 10 cups a day.
  2. We ask every patient: "How many cups of coffee do you drink each day?".
  3. The answer to this question (interpreted by the physician) and the corresponding patient are coded and stored in a central place.
  4. After a certain number of patients have been treated for a certain period the results of all the treatments are evaluated.
  5. Now we can calculate the prevalence of ‘Desire for coffee’ in the group that responded to Angustura and the prevalence of the symptom in the rest of the population.
  6. This results in the LR.

There is no placebo-control, but a comparison between two groups, ‘Angustura’ and ‘not-Angustura’. For the patient there is no perceptible difference in the treatment whether he is part of this investigation or not. For the doctor there is some extra work in gathering the data.

What does the population we investigate look like? We start from a symptom that we investigate in a number of patients. But of course we can take several symptoms at a time. Now we have to define the groups in the 2x2 diagram like we showed in the paragraph ‘Likelihood ratio and homeopathy’.

Here we have to answer the next questions:

  1. What is a positive effect?
  2. What about the patients that got more than one medicine?

These problems are still to be resolved. Then there is the size of the population we have to investigate. What can we expect? There are different classes of symptoms like:

A. Rare symptoms confined to one medicine

B. Frequently occurring symptoms belonging to a moderate number of medicines

C. Frequently occurring symptoms occurring to many medicines

And, of course, all possible varieties. An example of group A. could be ‘Nausea from the sound of an organ’ (Physostigma). If we want to investigate this symptom we need to include many patients if we want to see this symptom at all. The ‘B-class symptom’ ‘Nausea from odours’ is not that rare: probably frequently occurring with Colchicum, but perhaps rare in relation to the medicine Eupatorium-perfoliatum. The ‘C-class’ symptom ‘Nausea during headache’ occurs frequently, but in varying degrees with different medicines. If our population is small we can only investigate frequently occurring symptoms related to medicines that are frequently associated with that symptom.

We need to include many patients in such an investigation, especially if we want to assess the LR of rare symptoms. It will be necessary to conduct the research at several centres. Our main concern is a design that causes the least possible interference with daily practice. Enthusiasm will be needed to perform the same investigation over and over again for different sets of symptoms. The doctors participating need to be involved in the implementation of the project, and standardisation must be assured. International agreement and co-operation is needed to bring such a project to fruition. We invite all colleagues who are interested to contact us.

Discussion

The transition from the homeopathic notion of ‘important symptom’ to the modern epidemiological idea of LR has great potential. The task is enormous, in return we get more reliable instruments. Symptoms can be seen as diagnostic instruments and the LR as an indication for optimal use. In future, computer programs may show you the symptoms with the highest LR(+) to confirm that medicine, or the lowest LR(-) to exclude it. This may be a solution to shortcomings of the repertory, e.g. handling rare medicines. Furthermore the new data will be based on empirical evidence instead of estimation.

We think that we can and must develop a scientific identity of our own to improve our method instead of proving it, . But one question provokes the next one, e.g.: to research LR properly we need better descriptions of symptoms and better definition of our ‘gold standard’, the cure. Can we develop an organisation and sufficient communication to establish this new scientific identity?

Conclusions

  1. The likelihoodratio(+) can be used as a mathematical representation of §153 of the Organon. That means that symptoms belonging to medicines are accessible for statistical research. This offers the possibility of giving a more accurate and quantitative description of a symptom, based on empirical data.
  2. Little known medicines have few symptoms in the repertory. Probably these symptoms are undervalued. The use of likelihoodratio can correct misrepresentation.
  3. It is likely that plain-type entries of a medicine with many symptoms in a large rubric stand for a likelihood ratio =<1. This means that these symptoms cannot be used to include the medicine in our considerations.
  4. In future typefaces can indicate the weight of the argument of the symptom in relation with a medicine.
  5. We propose a research project on likelihood ratios. International agreement and co-operation is needed.

 

Literature:

1. Sackett DL, Haynes RB, Guyatt GH, Tugwell P. Clinical epidemiology: a basic science for clinical medicine. 2nd ed. Boston: Little, Brown, 1991
2. Vermeulen M. Dwalingen in de methodologie. XXXVI. Van 'likelihood'-ratio's en de regel van Bayes. NTvG 2001/50: 2421-4
3. Horton MD, Counter SF, Florence MG, Hart MJ. A prospective trial of computed tomography and ultrasonography for diagnosing appendicitis in the atypical patient. Am J Surg 2000;179(5):379-81
4. Ende J van den, Derese A, e.a. Medische besliskunde. Huisarts Nu 1996;25(9):283-344
5. Elstein AE, Schwarz A. Clinical problem solving and diagnostic decision making: selective review of the cognitive literature. BMJ 2002;324:729-32
6. Stolper CF, Kipp R, Lugten RFG. Consensus 'Casus geslaagd of niet?'. Similia Similibus Curentur. 1997;27(1):21-2
7. Stolper CF,Lugten RFG, Rutten ALB. Materia Medica Validering: Lachesis en Tarentula. Similia Similibus Curentur2000;30(4)18-9
8. Kent JT. Repertory of the Homeopathic Materia Medica. 6th edition, World Homeopathic Links, New Delhi1982
9. Hering C. Guiding Symptoms of our Materia Medica. B. Jain Publishers 1974
10. Clarke JH. A dictionairy of practical materia medica. Jain Publishing, New Delhi, 1984
11. Swayne J. Homeopathic method. Implications for clinical practice and medical science. Chapter 6.Churchill Livingstone 1998
12. Rutten ALB, Stolper CF, Lugten RFG, Barthels RJWM. Waar, niet waar of waarschijnlijk? Een eigen wetenschappelijke identiteit voor de homeopathie. Similia Similibus Curentur 2001;31(1):3-30
13. Fisher P. Prove it or improve it? Br. Hom. J. 1995;84:129-30

 

Lex Rutten, MD

Aard 10 - 4813 NN Breda, Netherlands