Clinical Trial, Phase II
Journal Article
Randomized Controlled Trial
Research Support, Non-U.S. Gov't
Add like
Add dislike
Add to saved papers

Suboptimal reliability of liver biopsy evaluation has implications for randomized clinical trials.

Journal of Hepatology 2020 December
BACKGROUND & AIMS: Liver biopsies are a critical component of pivotal studies in non-alcoholic steatohepatitis (NASH), constituting inclusion criteria, risk stratification factors and endpoints. We evaluated the reliability of NASH Clinical Research Network scoring of liver biopsies in a NASH clinical trial.

METHODS: Digitized slides of 678 biopsies from 339 patients with paired biopsies randomized into the EMMINENCE study - examining a novel insulin sensitizer (MSDC-0602K) in NASH - were read independently by 3 hepatopathologists blinded to treatment code and scored using the NASH CRN histological scoring system. Various endpoints were computed from these scores.

RESULTS: Inter-reader linearly weighted kappas were 0.609, 0.484, 0.328, and 0.517 for steatosis, fibrosis, lobular inflammation, and ballooning, respectively. Inter-reader unweighted kappas were 0.400 for the diagnosis of NASH, 0.396 for NASH resolution without worsening fibrosis, and 0.366 for fibrosis improvement without worsening NASH. In the current study, 46.3% of the patients included in the study based on 1 hepatopathologist's qualifying reading were deemed not to meet the study's histologic inclusion criteria by at least 1 of the 3 hepatopathologists. The MSDC-0602K treatment effect was lowest for those histologic features with lower inter-reader reliability. Simulations show that the lack of reliability of endpoints and inclusion criteria can drastically reduce study power - from >90% in a well-powered study to as low as 40%.

CONCLUSIONS: The reliability of hepatopathologists' liver biopsy evaluation using currently accepted criteria is suboptimal. This lack of reliability may affect NASH pivotal studies by introducing patients who do not meet NASH study entry criteria, misclassifying fibrosis subgroups, and attenuating apparent treatment effects.

LAY SUMMARY: Since liver biopsy analysis plays such an important role in clinical studies of non-alcoholic steatohepatitis, it is important to understand the reliability of hepato-pathologist readings. We examined both inter- and intra-reader variability in a large data set of paired liver biopsies from a clinical trial. We found very poor inter-reader and modest intra-reader variability. This result has important implications for entry criteria, fibrosis stratification, and the ability to measure a treatment effect in clinical trials.

Full text links

We have located links that may give you full text access.
Can't access the paper?
Try logging in through your university/institutional subscription. For a smoother one-click institutional access experience, please use our mobile app.

Related Resources

For the best experience, use the Read mobile app

Mobile app image

Get seemless 1-tap access through your institution/university

For the best experience, use the Read mobile app

All material on this website is protected by copyright, Copyright © 1994-2024 by WebMD LLC.
This website also contains material copyrighted by 3rd parties.

By using this service, you agree to our terms of use and privacy policy.

Your Privacy Choices Toggle icon

You can now claim free CME credits for this literature searchClaim now

Get seemless 1-tap access through your institution/university

For the best experience, use the Read mobile app