Competing for Pixels: A Self-Play Algorithm for Weakly-Supervised Semantic Segmentation

Shaheer U. Saeed, Shiqi Huang, João Ramalhinho, Iani J.M.B. Gayo, Nina Montaña-Brown, Ester Bonmati, Stephen P. Pereira, Brian Davidson, Dean C. Barratt, Matthew J. Clarkson and Yipeng Hu 2025. Competing for Pixels: A Self-Play Algorithm for Weakly-Supervised Semantic Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 47 (2), pp. 825-839. https://doi.org/10.1109/TPAMI.2024.3474094

TitleCompeting for Pixels: A Self-Play Algorithm for Weakly-Supervised Semantic Segmentation
TypeJournal article
AuthorsShaheer U. Saeed, Shiqi Huang, João Ramalhinho, Iani J.M.B. Gayo, Nina Montaña-Brown, Ester Bonmati, Stephen P. Pereira, Brian Davidson, Dean C. Barratt, Matthew J. Clarkson and Yipeng Hu
Abstract

Weakly-supervised semantic segmentation (WSSS) methods, reliant on image-level labels indicating object presence, lack explicit correspondence between labels and regions of interest (ROIs), posing a significant challenge. Despite this, WSSS methods have attracted attention due to their much lower annotation costs compared to fully-supervised segmentation. Leveraging reinforcement learning (RL) self-play, we propose a novel WSSS method that gamifies image segmentation of a ROI. We formulate segmentation as a competition between two agents that compete to select ROI-containing patches until exhaustion of all such patches. The score at each time-step, used to compute the reward for agent training, represents likelihood of object presence within the selection, determined by an object presence detector pre-trained using only image-level binary classification labels of object presence. Additionally, we propose a game termination condition that can be called by either side upon exhaustion of all ROI-containing patches, followed by the selection of a final patch from each. Upon termination, the agent is incentivised if ROI-containing patches are exhausted or disincentivised if a ROI-containing patch is found by the competitor. This competitive setup ensures minimisation of over- or under-segmentation, a common problem with WSSS methods. Extensive experimentation across four datasets demonstrates significant performance improvements over recent state-of-the-art methods.

Keywordsself-play, weak supervision, segmentation
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Journal citation47 (2), pp. 825-839
ISSN1939-3539
2160-9292
0162-8828
Year2025
PublisherIEEE
Accepted author manuscript
License
CC BY 4.0
File Access Level
Open (open metadata and files)
Digital Object Identifier (DOI)https://doi.org/10.1109/TPAMI.2024.3474094
Publication dates
Published03 Oct 2024
Published in printFeb 2025

Related outputs

Enhanced CATBraTS for Brain Tumour Semantic Segmentation
El Badaoui, R., Bonmati Coll, E., Psarrou, A., Asaturyan, H. and Villarini, B. 2025. Enhanced CATBraTS for Brain Tumour Semantic Segmentation. Journal of Imaging. 11 (1), p. 8. https://doi.org/10.3390/jimaging11010008

Active learning using adaptable task-based prioritisation
Shaheer U. Saeed, João Ramalhinho, Mark Pinnock, Ziyi Shen, Yunguan Fu, Nina Montaña-Brown, Ester Bonmati, Dean C. Barratt, Stephen P. Pereira, Brian Davidson, Matthew J. Clarkson and Yipeng Hu 2024. Active learning using adaptable task-based prioritisation. Medical Image Analysis. 95 103181. https://doi.org/10.1016/j.media.2024.103181

Fan-Slicer: A Pycuda Package for Fast Reslicing of Ultrasound Shaped Planes
João Ramalhinho, Thomas Dowrick, Bonmati Coll, E. and Matthew J. Clarkson 2023. Fan-Slicer: A Pycuda Package for Fast Reslicing of Ultrasound Shaped Planes. Journal of Open Research Software. 11 (1), p. 3. https://doi.org/10.5334/jors.422

Voice-Assisted Image Labeling for Endoscopic Ultrasound Classification Using Neural Networks
Bonmati Coll, E., Hu, Y., Grimwood, A., Johnson, G.J., Goodchild, G., Keane, M.G., Gurusamy, K., Davidson, B., Clarkson, M.J., Pereira, S.P. and Barratt, D.C. 2022. Voice-Assisted Image Labeling for Endoscopic Ultrasound Classification Using Neural Networks. IEEE Transactions on Medical Imaging. 41 (6), pp. 1311-1319. https://doi.org/10.1109/tmi.2021.3139023

Assisted Probe Positioning for Ultrasound Guided Radiotherapy Using Image Sequence Classification
Grimwood, A., McNair, H., Hu, Y., Bonmati Coll, E., Barratt, D. and Harris, E.J. 2020. Assisted Probe Positioning for Ultrasound Guided Radiotherapy Using Image Sequence Classification. Medical Image Computing and Computer Assisted Intervention – MICCAI 2020, 23rd International Conference. Lima, Peru 04 - 08 Oct 2020 Springer. https://doi.org/10.1007/978-3-030-59716-0_52

Novel Brain Complexity Measures Based on Information Theory
Bonmati Coll, E., Bardera, A., Feixas, M. and Boada, I. 2018. Novel Brain Complexity Measures Based on Information Theory. Entropy. 20 (7) 491. https://doi.org/10.3390/e20070491

Determination of optimal ultrasound planes for the initialisation of image registration during endoscopic ultrasound-guided procedures
Bonmati Coll, E., Hu, Y., Gibson, E., Uribarri, L., Keane, G., Gurusami, K., Davidson, B., Pereira, S.P., Clarkson, M.J. and Barratt, D.C. 2018. Determination of optimal ultrasound planes for the initialisation of image registration during endoscopic ultrasound-guided procedures. International Journal of Computer Assisted Radiology and Surgery. 13, pp. 875-883. https://doi.org/10.1007/s11548-018-1762-2

Automatic Multi-Organ Segmentation on Abdominal CT with Dense V-Networks
Gibson, E., Giganti, F., Hu, Y., Bonmati Coll, E., Bandula, S., Gurusamy, K., Davidson, B., Pereira, S.P., Clarkson, M.J. and Barratt, D.C. 2018. Automatic Multi-Organ Segmentation on Abdominal CT with Dense V-Networks. IEEE Transactions on Medical Imaging. 37 (8), pp. 1822-1834. https://doi.org/10.1109/tmi.2018.2806309

Automatic segmentation method of pelvic floor levator hiatus in ultrasound using a self-normalizing neural network
Bonmati Coll, E., Hu, Y., Sindhwani, N., Dietz, H.P., D'hooge, J., Barratt, D., Deprest, J. and Vercauteren, T. 2018. Automatic segmentation method of pelvic floor levator hiatus in ultrasound using a self-normalizing neural network. Journal of Medical Imaging. 5 (2) 021206. https://doi.org/10.1117/1.jmi.5.2.021206

Brain parcellation based on information theory
Bonmati Coll, E., Bardera, A. and Boada, I. 2017. Brain parcellation based on information theory. Computer Methods and Programs in Biomedicine. 151, pp. 203-212. https://doi.org/10.1016/j.cmpb.2017.07.012

Towards image-guided pancreas and biliary endoscopy: Automatic multi-organ segmentation on abdominal CT with dense dilated networks
Gibson, E., Giganti, F., Hu, Y., Bonmati Coll, E., Bandula, S., Gurusamy, K., Davidson, B.R., Pereira, S.P., Clarkson, M.J. and Barratt, D.C. 2017. Towards image-guided pancreas and biliary endoscopy: Automatic multi-organ segmentation on abdominal CT with dense dilated networks. Medical Image Computing and Computer Assisted Intervention − MICCAI 2017. Quebec City, QC, Canada 11 - 13 Sep 2017 Springer. https://doi.org/10.1007/978-3-319-66182-7_83

2D-3D Registration Accuracy Estimation for Optimised Planning of Image-Guided Pancreatobiliary Interventions
Hu, Y., Bonmati Coll, E., Gibson, E., Hipwell, J.H., Hawkes, D.J., Bandula, S., Pereira, S.P. and Barratt, D.C. 2016. 2D-3D Registration Accuracy Estimation for Optimised Planning of Image-Guided Pancreatobiliary Interventions. Medical Image Computing and Computer-Assisted Intervention – MICCAI 2016. MICCAI 2016. Athens, Greece 17 - 21 Oct 2016 Springer. https://doi.org/10.1007/978-3-319-46720-7_60

Assessment of Electromagnetic Tracking Accuracy for Endoscopic Ultrasound
Bonmati Coll, E., Hu, Y., Gurusamy, K., Davidson, B., Pereira, S.P., Clarkson, M.J. and Barratt, D.C. 2016. Assessment of Electromagnetic Tracking Accuracy for Endoscopic Ultrasound. Computer-Assisted and Robotic Endoscopy. CARE 2016. Athens, Greece 17 Oct 2016 Springer. https://doi.org/10.1007/978-3-319-54057-3_4

Measuring Complex Brain Networks Structure
Bonmati Ester, Bardera Anton, Boada Imma and Bonmati Coll, E. 2016. Measuring Complex Brain Networks Structure. Frontiers in Neuroinformatics. Conference Abstract: Neuroinformatics 2016. https://doi.org/10.3389/conf.fninf.2016.20.00012

Hierarchical clustering based on the information bottleneck method using a control process
Bonmati Coll, E., Bardera, A., Boada, I., Feixas, M. and Sbert, M. 2015. Hierarchical clustering based on the information bottleneck method using a control process. Pattern Analysis and Applications (PAA). 18, pp. 619-637. https://doi.org/10.1007/s10044-015-0467-1

Permalink - https://westminsterresearch.westminster.ac.uk/item/wy65z/competing-for-pixels-a-self-play-algorithm-for-weakly-supervised-semantic-segmentation


Share this

Usage statistics

3 total views
3 total downloads
These values cover views and downloads from WestminsterResearch and are for the period from September 2nd 2018, when this repository was created.