Artificial intelligence for Clinical Pathology: Data-efficient foundation model for biomarker detection
Bremen, Germany (02 September 2024) – The use of Artificial Intelligence (AI) systems shows promise in medicine, where they can be used to detect diseases earlier, improve treatments, and ease staff workloads. But their performance depends on how well the AI is trained. A new multi-task approach to training AI makes it possible to train foundation models quicker and more cost-effectively, with less data. Researchers are turning to this approach to compensate for the shortage of data in medical imaging — and ultimately save lives.
According to the World Health Organization (WHO), there has been a significant increase in cases of cancer worldwide. Clear indicators, known as biomarkers, are key to reliable diagnosis and successful treatment. AI systems can help identify these kinds of measurable parameters in pathological images. Researchers from the Fraunhofer Institute for Digital Medicine MEVIS teamed up with RWTH Aachen University, the University of Regensburg, and Hannover Medical School to develop a foundation model for this. The resource-efficient model analyses tissue samples quickly and reliably, based on just a fraction of the usual training data.
Moving away from large volumes of data and self-supervised learning
Standard foundation models, like the large language models used for ChatGPT, are trained using large and diverse data sets, supervising themselves as they learn. But for medical image analysis, data is generally scarce, and in fact, the small amounts of data available in clinical studies pose a major challenge for the use of AI. In addition, clinical centres differ in how they process pathological preparations and in their patient populations — even before the specific form and characteristics of diseases are considered.
All these factors make it harder to reliably detect existing patterns, and thus diagnostically relevant characteristics. To train AI effectively, this means large volumes of training images from different origins are typically needed. But each cross-sectional image of tissue is typically several gigabytes in size, containing thousands of different cells but only reflecting a tiny fraction of the variability present.
Specialization follows solid foundational training
Fraunhofer MEVIS has devised a solution based on supervised pre-training. “We’re developing a training strategy for foundational AI modelled on the training that pathologists undergo. They don’t have to relearn what a nucleus is all over again in each case. That’s textbook knowledge. Once these concepts have been covered, they’re present as a foundation and can be applied to various diseases,” explains Dr Johannes Lotz, an expert from Fraunhofer MEVIS.
In much the same way, their AI model undergoes foundational training, learning general characteristics and laws known as tissue concepts from a broad collection of tissue section images created with various tasks. Combining these tasks gives rise to the large volumes of data needed to train a robust large AI model. The learned tissue concepts are then applied to a specific task in a second step. In this way, the algorithms can identify biomarkers distinguishing different types of tumours, for example — all with much less data.
“In our solution, every data set has been annotated by a specially trained human with the information that needs to be learned,” explains Jan Raphael Schäfer, an AI expert at Fraunhofer MEVIS who works in Lotz’s team. “We give our model the image and provide the answer at the same time. And we do it for numerous different tasks simultaneously, using a multi-task approach.”
The team also uses an image registration method developed at the institute: HistokatFusion. This method makes it possible to generate automatically annotated training data from tissue studies such as immunohistochemical staining, thereby using marked antibodies to visualize proteins or other structures. To do this, this method combines information from multiple histopathological images. The experts incorporate these automatically generated annotations into the training of their model, which accelerates data collection.
Outstanding results with just 6% of the resources
Compared to models that do not involve supervised training, the Fraunhofer researchers’ approach achieves similar results with only six percent of the training data. “Since the amount of training data in deep learning correlates with training effort and processing power, we found that we needed about six percent of the resources typically required. Furthermore, we only need about 160 hours of training, which is a crucial cost factor. This means we can train an equivalent model with much less effort,” Lotz explains.
The Fraunhofer experts’ participation in the international SemiCOL (Semi-supervised learning for colourectal cancer detection) competition for cancer classification and segmentation showed how well these pre-trained models can be generalized. The team won the classification part of the challenge without having to undertake expensive adjustments to their model and ultimately came in second out of nine participating teams.
Tests of interactive image segmentation, in which tissue structures are automatically detected and measured in an image, also show that this method has great potential. The model needs only a few sample image sections to extend concepts that it has already learned. But that isn’t all. “Models based on our solution make it possible to develop new interactive medical AI training tools that let specialists interact directly with AI solutions and train relevant models quickly, even without any technical background knowledge,” says Schäfer.
Freely accessible and transferable
The researchers publish the pre-trained model and the code for further learning on various platforms. This lets specialists use it for non-commercial purposes, developing their own solutions. The team is also working with clinical partners to have the solution approved for medical applications and to systematically validate it. The experts at Fraunhofer MEVIS are certain that once in day-to-day clinical practice, systems involving their foundation model will reduce workloads in pathology and improve the success of treatment.
Footwear Industry Articles
- Bata SA: Renewed effort to grow brands with chainsBubblegummers to be ‘a priority product’Durban, KZN, SA – Broadening Bata South Africa’s brand base is a priority – perhaps the priority – for country manager Lorraine Dyer, who joined in August with a background managing apparel brands including Converse and Calvin Klein.That has been the goal under su ...moreThe Road Ahead: A forecast for South Africa’s footwear and leather goods industriesDurban, KZN, SA – The South African footwear and leather goods industries are navigating a challenging yet potentially transformative period. Shaped by a mix of longstanding hurdles and emerging opportunities, their future remains uncertain as the global market continues to shift rapidly in the wake of COVID-19. While difficulties p ...moreArchroma ‘breakthrough’ in bio-based textile printing for ‘brilliant black’Pratteln, Switzerland (14 October 2024) – Archroma has introduced the NTR Printing System to make bio-based pigment printing commercially possible for the first time. Based on renewable raw materials* and designed for safer chemistry, it helps apparel and textile brands reduce their environmental footprint while producing brilliant ...more
Leather Industry Articles
- Agency company changesCape Town, W. Cape, SA – Gert Bosch and Arné Husselman left TST Agencies earlier this year.Bosch started Bosch Technical Services (Pty) Ltd (BTS), which began trading on 01 March, servicing the tanning industry. He represents a number of principals including some formerly represented by TST.Husselman has left the industry and ...moreOnly Natural: Inaugural competition, funded by Leather & Hide Council of America and others, challenges fast fashionFrom algae lamps to bamboo chairs: young designers shine with sustainable creations International student competition challenged entrants to design bespoke pieces using only natural materials, inspired by nature’s beauty Students from London College of Fashion, Central Saint Martins, The Royal College of Arts, Nottingham T ...moreSlavesPublicity has always been a very important trade tool and its use to sell products goes back ‘ages’. The use of publicity is becoming more and more important as people, the consumers, are not thinking anymore with their own heads, but follow what others, so-called self-promoted influencers, tell them how to behave, how to dres ...more
PPE Industry Articles
- Circular economy: Fungal mycelium as the basis for sustainable productsPotsdam, Germany (02 September 2024) – Fungi have more to offer than meets the eye. Their thread-like cells, which grow extensively and out of sight underground like a network of roots, offer huge potential for producing sustainable, biodegradable materials. Researchers at the Fraunhofer Institute for Applied Polymer Research IAP in ...moreThe importance of supporting local manufacturing to preserve South Africa’s footwear industryPinetown, KZN, SA – At Neptun Safety Footwear, our commitment to local manufacturing goes beyond producing quality footwear—it’s about building a sustainable, self-reliant economy in South Africa. We are passionate about growing local industries, not just for the jobs they create today but for the future they secure for ...moreAI solution shows truck drivers where to parkBerlin, Germany (02 September 2024) – Parking spaces are in short supply at rest stops along Germany’s highways. Truck drivers are required by law to observe certain rest periods, so when they cannot find parking, they often simply park off to the side of an entrance or exit or on the shoulder, all of which are very ...more