Model Card for Model ID mkang315/PK-YOLO
Model Details
Model Description
- Developed by: Ming Kang
- Funded by Monash University: Faculty of Information Technology (FIT)
- Shared by Ming Kang: Affirom Monash FIT
- Model type: Pretrained medical image model, pretrained brain MRI image weights, brain tumor detection
- License: GNU General Public License 3.0
- Pretrained from models RepViT-M2.3 and Sparse masKed modeling (SparK): Pretrained RepViT via SparK as YOLOv9-E backbone
Model Sources
- Repository: Github
- Paper (published): PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplane MRI Slices
- CVF: WACV 2025 open access provided by the Computer Vision Foundation
- IEEE Xplore: Proceedings of the 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2025)
- ArXiv: arxiv:2410.21822
- Daily Papers: hf:2410.21822
Uses
Installation
Install requirements.txt in a Python>=3.8.0 environment, including PyTorch>=1.7.0.
pip install -r requirements.txt
Direct Use
python train_dual.py
Citation
Please cite the paper if using this model. Here is a guide to referencing this work in various styles for formatting your references:
BibTeX:
\begin{thebibliography}{1}
\bibitem{Kang25Pkyolo} M. Kang, F. F. Ting, R. C.-W. Phan, and C.-M. Ting, "Pk-yolo: Pretrained knowledge guided yolo for brain tumor detection in multiplane mri slices," in {\emph Proc. Winter Conf. Appl. Comput. Vis. (WACV)}, Tucson, AZ, USA, Feb. 28--Mar. 4, 2025, pp. 3732--3741.
\end{thebibliography}
@inproceedings{Kang25Pkyolo,
author = "Ming Kang and Fung Fung Ting and Rapha{\"e}l C.-W. Phan and Chee-Ming Ting",
title = "Pk-yolo: Pretrained knowledge guided yolo for brain tumor detection in multiplane mri slices",
booktitle = "Proc. Winter Conf. Appl. Comput. Vis. (WACV)",
% booktitle = WACV, %% IEEE Full Name Reference Style
address = "Tucson, AZ, USA, Feb. 28--Mar. 4",
pages = "3732--3741",
year = "2025"
}
@inproceedings{Kang25Pkyolo,
author = "Kang, Ming and Ting, Fung Fung and Phan, Rapha{\"e}l C.-W. and Ting, Chee-Ming",
title = "{PK-YOLO}: pretrained knowledge guided {YOLO} for brain tumor detection in multiplane {MRI} slices",
editor = "",
booktitle = "2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)",
series = "",
volume = "",
pages = "3732--3741",
publisher = "IEEE",
address = "Piscataway",
year = "2025",
doi= "10.1109/WACV61041.2025.00367",
url = "https://doi.org/10.1109/WACV61041.2025.00367"
}
IEEE Full Name Reference Style:
Ming Kang, Fung Fung Ting, Raphaël C.-W. Phan, and Chee-Ming Ting. Pk-yolo: Pretrained knowledge guided yolo for brain tumor detection in multiplane mri slices. In WACV, pages 3732–3741, 2025.
NOTE: This is a modification to the standard IEEE Reference Style and used by most IEEE/CVF conferences, including CVPR, ICCV, and WACV, to render first names in the bibliography as "Firstname Lastname" rather than "F. Lastname" or "Lastname, F.", which the reference styles of NeurIPS, ICLR, and IJCAI are similar to.
IEEE Reference Style:
M. Kang, F. F. Ting, R. C.-W. Phan, and C.-M. Ting, "Pk-yolo: Pretrained knowledge guided yolo for brain tumor detection in multiplane mri slices," in Proc. Winter Conf. Appl. Comput. Vis. (WACV), Tucson, AZ, USA, Feb. 28–Mar. 4, 2025, pp. 3732–3741.
NOTE: City of Conf., Abbrev. State, Country, Month & Day(s) are optional.
Nature Reference Style:
Kang, M., Ting, C.-M., Ting, F. F. & Phan, R. C.-W. PK-YOLO: pretrained knowledge guided YOLO for brain tumor detection in multiplane MRI slices. In 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 3732–3741 (IEEE, 2025).
Springer Reference Style:
Kang, M., Ting, F.F., Phan, R.C.-W., Ting, C.-M.: PK-YOLO: pretrained knowledge guided YOLO for brain tumor detection in multiplane MRI slices. In: 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp. 3732–3741. IEEE, Piscataway (2025)
NOTE: MICCAI conference proceedings are part of the book series LNCS in which Springer's format for bibliographical references is strictly enforced. This is important, for instance, when citing previous MICCAI proceedings. LNCS stands for Lecture Notes in Computer Science.
Elsevier Numbered Style:
M. Kang, F.F. Ting, R.C.-W. Phan, C.-M. Ting, PK-YOLO: pretrained knowledge guided YOLO for brain tumor detection in multiplane MRI slices, in: Proceedings of the 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025, pp. 3732–3741.
NOTE: Day(s) Month Year, City, Abbrev. State, Country of Conference, Publiser, and Place of Publication are optional and omitted.
Elsevier Name–Date (Harvard) Style:
Kang, M., Ting, F.F., Phan, R.C.-W., Ting, C.-M., 2025. PK-YOLO: pretrained knowledge guided YOLO for brain tumor detection in multiplane MRI slice. In: Proceedings of the 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 28 Februray–4 March 2025, Tucson, AZ, USA. IEEE, Piscataway, New York, USA, pp. 3732–3741.
NOTE: Day(s) Month Year, City, Abbrev. State, Country of Conference, Publiser, and Place of Publication are optional.
APA7: Kang, M., Ting, F.F., Phan, R.C.-W., & Ting, C.-M. (2025). PK-YOLO: Pretrained knowledge guided YOLO for brain tumor detection in multiplane MRI slice. In Proceedings of the 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp. 3732–3741). IEEE. https://doi.org/10.1109/WACV61041.2025.00367
Model Card Author
Ming Kang
Model Card Contact
- Downloads last month
- -
Model tree for mkang315/PK-YOLO
Base model
timm/repvit_m2_3.dist_300e_in1k