Researchers left at crossroads after doubts cast on scientific probability test

It’s amazing what scientists have found out about human behaviour. They have shown that watching a heart-warming film clip can make us more patient, while how we feel about relatives can be influenced by where we put dots on graph paper.

Such unusual insights have long been a staple of media coverage of science – the “amazing but true” tales we all know and love.

Whether we believe them is, however, another matter. Certainly doubts about the reliability of such claims have long circulated among academics.

Now these doubts are fuelling a controversy about the future of the US$1.5 trillion (Dh5.5tn) global research enterprise.

It centres on the reliability of the techniques routinely used to decide if a finding is worth taking seriously.

This month, the American Statistical Association (ASA) sent shockwaves through the research community by voicing concern that “misunderstanding or misuse” of these techniques is leading to claims that too often fail to stand up.

The unprecedented public statement follows the failure of attempts to replicate findings published in research journals – among them those claims about heart-warming film clips and dots on graph paper.

Such failures are causing “much confusion and even doubt about the validity of science”, according to the ASA, which is now calling for “renewed and vigorous attention to changing the practice of science”.

It is hard to overstate the implications of the ASA’s statement. Replication is the acid test of science, with a track record of weeding out faulty, flawed or fraudulent claims.

The ASA is now raising concerns about the reliability of techniques playing a key part in that process.

Taught to generations of researchers, so-called significance testing is supposed to cast light on the likely success of replication. At its core is a figure known as the p-value, which is worked out from raw data.

This measures the chances of getting at least as impressive an outcome as that seen, assuming it’s really just a fluke.

By convention, if the p-value comes out at less than 1 in 20, the outcome is deemed “statistically significant”, on the grounds that it’s unlikely to be a fluke.

Of course, a study result can be misleading for many other reasons, from dodgy design to faulty equipment. But at least mere chance has been ruled out with 95 per cent reliability.

Except it has not – and believing otherwise is precisely the misunderstanding that the ASA is worried about.

The belief that p-values measure the chances of a result being just a fluke is alarmingly widespread, and even pops up in many textbooks. But as they are calculated on the assumption that fluke is the true cause, p-values clearly cannot also be used to test if that assumption is true.

Doing so is akin to assuming a rule is accurate, and then claiming to show it by measuring the distance between two points using the same rule.

The good news is that it is possible to convert p-values into the chances of a finding being a fluke. The bad news is that most researchers do not know how to do it. Instead, they simply flip p-values below one in 20 around to convince themselves the chances of their result being real are thus 19 in 20.

The dangers of such faulty reasoning can no longer be dismissed as academic.

Last year, the journal Science published the outcome of an international effort to replicate 100 studies published in three psychology journals.

Virtually all the original studies had passed the classic test of “statistical significance”, with p-values below one in 20. Yet barely one in three of the studies were successfully replicated, and even those that were typically produced much less impressive outcomes than the original studies.

This month, Science published the outcome of a similar effort at replication, this time of studies published in two leading economics journals. The results were more encouraging – even so, about 40 per cent of the original findings failed to replicate and again, the findings were typically far less impressive than in the original studies.

As the ASA points out, none of this should come as a surprise. Statisticians have been warning researchers about the dangers of misinterpreting p-values for decades. Even the Cambridge mathematician who invented significance tests in the 1920s knew the scope for misunderstanding.

Shortly after including them in his hugely influential textbook Statistical Methods for Research Workers, Ronald Fisher advised researchers to use p-values only as a test of what to ignore, rather than what to take seriously.

Yet by the 1950s, Fisher’s tests had become a part of every scientist’s toolkit for making discoveries, while their “terms and conditions” were ignored.

In its statement, the ASA highlights other abuses of p-values. These include “data dredging”, where researchers hunt for anything giving p-values below one in 20 – and thus “statistical significant” findings they can publish. Such practices haunt the burgeoning field of Big Data, relied on by businesses to extract insight from their data sets.

The ASA wants to see researchers move away from the simplistic pass-fail mentality encouraged by p-values, towards more sophisticated alternatives. That is, however, more easily said than done.

But there is a more formidable barrier to change: the profession of science itself. Anyone wanting a career in research must publish in journals, which in turn prefer eye-catching advances over damp squibs. Until now, both these agendas have benefited from the low bar for “significance” set by p-values. Moving to alternatives as suggested by the ASA is likely to set that bar higher.

It’s no exaggeration to say that the scientific enterprise stands at a crossroads. Will researchers opt to give themselves a much harder time, or will they continue to fool themselves and us with “discoveries” that are neither amazing nor true?

Robert Matthews is visiting professor of Science at Aston University in Birmingham, England. His new book Chancing It: The Laws of Chance and What they Mean for you is out now.

TYPES%20OF%20ONLINE%20GIG%20WORK

%3Cp%3E%3Cstrong%3EDesign%2C%20multimedia%20and%20creative%20work%3A%20%3C%2Fstrong%3ELogo%20design%2C%20website%20design%2C%20visualisations%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EBusiness%20and%20professional%20management%3A%20%3C%2Fstrong%3ELegal%20or%20management%20consulting%2C%20architecture%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EBusiness%20and%20professional%20support%3A%20%3C%2Fstrong%3EResearch%20support%2C%20proofreading%2C%20bookkeeping%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3ESales%20and%20marketing%20support%3A%20%3C%2Fstrong%3ESearch%20engine%20optimisation%2C%20social%20media%20marketing%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EData%20entry%2C%20administrative%2C%20and%20clerical%3A%20%3C%2Fstrong%3EData%20entry%20tasks%2C%20virtual%20assistants%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EIT%2C%20software%20development%20and%20tech%3A%20%3C%2Fstrong%3EData%20analyst%2C%20back-end%20or%20front-end%20developers%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EWriting%20and%20translation%3A%20%3C%2Fstrong%3EContent%20writing%2C%20ghost%20writing%2C%20translation%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EOnline%20microtasks%3A%20%3C%2Fstrong%3EImage%20tagging%2C%20surveys%3C%2Fp%3E%0A%3Cp%3E%3Cem%3ESource%3A%20World%20Bank%3C%2Fem%3E%3C%2Fp%3E%0A

At a glance

Fixtures All matches start at 9.30am, at ICC Academy, Dubai. Admission is free

Thursday UAE v Ireland; Saturday UAE v Ireland; Jan 21 UAE v Scotland; Jan 23 UAE v Scotland

UAE squad Rohan Mustafa (c), Ashfaq Ahmed, Ghulam Shabber, Rameez Shahzad, Mohammed Boota, Mohammed Usman, Adnan Mufti, Shaiman Anwar, Ahmed Raza, Imran Haider, Qadeer Ahmed, Mohammed Naveed, Amir Hayat, Zahoor Khan

Mercer, the investment consulting arm of US services company Marsh & McLennan, expects its wealth division to at least double its assets under management (AUM) in the Middle East as wealth in the region continues to grow despite economic headwinds, a company official said.

Mercer Wealth, which globally has $160 billion in AUM, plans to boost its AUM in the region to $2-$3bn in the next 2-3 years from the present $1bn, said Yasir AbuShaban, a Dubai-based principal with Mercer Wealth.

“Within the next two to three years, we are looking at reaching $2 to $3 billion as a conservative estimate and we do see an opportunity to do so,” said Mr AbuShaban.

Mercer does not directly make investments, but allocates clients’ money they have discretion to, to professional asset managers. They also provide advice to clients.

“We have buying power. We can negotiate on their (client’s) behalf with asset managers to provide them lower fees than they otherwise would have to get on their own,” he added.

Mercer Wealth’s clients include sovereign wealth funds, family offices, and insurance companies among others.

From its office in Dubai, Mercer also looks after Africa, India and Turkey, where they also see opportunity for growth.

Wealth creation in Middle East and Africa (MEA) grew 8.5 per cent to $8.1 trillion last year from $7.5tn in 2015, higher than last year’s global average of 6 per cent and the second-highest growth in a region after Asia-Pacific which grew 9.9 per cent, according to consultancy Boston Consulting Group (BCG). In the region, where wealth grew just 1.9 per cent in 2015 compared with 2014, a pickup in oil prices has helped in wealth generation.

BCG is forecasting MEA wealth will rise to $12tn by 2021, growing at an annual average of 8 per cent.

Drivers of wealth generation in the region will be split evenly between new wealth creation and growth of performance of existing assets, according to BCG.

Another general trend in the region is clients’ looking for a comprehensive approach to investing, according to Mr AbuShaban.

“Institutional investors or some of the families are seeing a slowdown in the available capital they have to invest and in that sense they are looking at optimizing the way they manage their portfolios and making sure they are not investing haphazardly and different parts of their investment are working together,” said Mr AbuShaban.

Some clients also have a higher appetite for risk, given the low interest-rate environment that does not provide enough yield for some institutional investors. These clients are keen to invest in illiquid assets, such as private equity and infrastructure.

“What we have seen is a desire for higher returns in what has been a low-return environment specifically in various fixed income or bonds,” he said.

“In this environment, we have seen a de facto increase in the risk that clients are taking in things like illiquid investments, private equity investments, infrastructure and private debt, those kind of investments were higher illiquidity results in incrementally higher returns.”

The Abu Dhabi Investment Authority, one of the largest sovereign wealth funds, said in its 2016 report that has gradually increased its exposure in direct private equity and private credit transactions, mainly in Asian markets and especially in China and India. The authority’s private equity department focused on structured equities owing to “their defensive characteristics.”

Researchers left at crossroads after doubts cast on scientific probability test

The American Statistical Association has sent shockwaves through the research community by voicing concern that “misunderstanding or misuse” of techniques used to identify the probability of findings being repeated is leading to claims that too often fail to stack up.

TYPES%20OF%20ONLINE%20GIG%20WORK

At a glance