Model Card for Model ID mkang315/PK-YOLO

Model Details

Model Description

  • Developed by: Ming Kang
  • Funded by Monash University: Faculty of Information Technology (FIT)
  • Shared by Ming Kang: Affirom Monash FIT
  • Model type: Pretrained medical image model, pretrained brain MRI image weights, brain tumor detection
  • License: GNU General Public License 3.0
  • Pretrained from models RepViT-M2.3 and Sparse masKed modeling (SparK): Pretrained RepViT via SparK as YOLOv9-E backbone

Model Sources

Uses

Installation

Install requirements.txt in a Python>=3.8.0 environment, including PyTorch>=1.7.0.

pip install -r requirements.txt

Direct Use

python train_dual.py

Citation

Please cite the paper if using this model. Here is a guide to referencing this work in various styles for formatting your references:

BibTeX:

\begin{thebibliography}{1}
\bibitem{Kang25Pkyolo} M. Kang, F. F. Ting, R. C.-W. Phan, and C.-M. Ting, "Pk-yolo: Pretrained knowledge guided yolo for brain tumor detection in multiplane mri slices," in {\emph Proc. Winter Conf. Appl. Comput. Vis. (WACV)}, Tucson, AZ, USA, Feb. 28--Mar. 4, 2025, pp. 3732--3741.
\end{thebibliography}
@inproceedings{Kang25Pkyolo,
  author = "Ming Kang and Fung Fung Ting and Rapha{\"e}l C.-W. Phan and Chee-Ming Ting",
  title = "Pk-yolo: Pretrained knowledge guided yolo for brain tumor detection in multiplane mri slices",
  booktitle = "Proc. Winter Conf. Appl. Comput. Vis. (WACV)",
  % booktitle = WACV, %% IEEE Full Name Reference Style
  address = "Tucson, AZ, USA, Feb. 28--Mar. 4",
  pages = "3732--3741",
  year = "2025"
}
@inproceedings{Kang25Pkyolo,
  author = "Kang, Ming and Ting, Fung Fung and Phan, Rapha{\"e}l C.-W. and Ting, Chee-Ming",
  title = "{PK-YOLO}: pretrained knowledge guided {YOLO} for brain tumor detection in multiplane {MRI} slices",
  editor = "",
  booktitle = "2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)",
  series = "",
  volume = "",
  pages = "3732--3741",
  publisher = "IEEE",
  address = "Piscataway",
  year = "2025",
  doi= "10.1109/WACV61041.2025.00367",
  url = "https://doi.org/10.1109/WACV61041.2025.00367"
}

IEEE Full Name Reference Style: Ming Kang, Fung Fung Ting, Raphaël C.-W. Phan, and Chee-Ming Ting. Pk-yolo: Pretrained knowledge guided yolo for brain tumor detection in multiplane mri slices. In WACV, pages 3732–3741, 2025.
NOTE: This is a modification to the standard IEEE Reference Style and used by most IEEE/CVF conferences, including CVPR, ICCV, and WACV, to render first names in the bibliography as "Firstname Lastname" rather than "F. Lastname" or "Lastname, F.", which the reference styles of NeurIPS, ICLR, and IJCAI are similar to.

IEEE Reference Style: M. Kang, F. F. Ting, R. C.-W. Phan, and C.-M. Ting, "Pk-yolo: Pretrained knowledge guided yolo for brain tumor detection in multiplane mri slices," in Proc. Winter Conf. Appl. Comput. Vis. (WACV), Tucson, AZ, USA, Feb. 28–Mar. 4, 2025, pp. 3732–3741.
NOTE: City of Conf., Abbrev. State, Country, Month & Day(s) are optional.

Nature Reference Style: Kang, M., Ting, C.-M., Ting, F. F. & Phan, R. C.-W. PK-YOLO: pretrained knowledge guided YOLO for brain tumor detection in multiplane MRI slices. In 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 3732–3741 (IEEE, 2025).

Springer Reference Style: Kang, M., Ting, F.F., Phan, R.C.-W., Ting, C.-M.: PK-YOLO: pretrained knowledge guided YOLO for brain tumor detection in multiplane MRI slices. In: 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp. 3732–3741. IEEE, Piscataway (2025)
NOTE: MICCAI conference proceedings are part of the book series LNCS in which Springer's format for bibliographical references is strictly enforced. This is important, for instance, when citing previous MICCAI proceedings. LNCS stands for Lecture Notes in Computer Science.

Elsevier Numbered Style: M. Kang, F.F. Ting, R.C.-W. Phan, C.-M. Ting, PK-YOLO: pretrained knowledge guided YOLO for brain tumor detection in multiplane MRI slices, in: Proceedings of the 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025, pp. 3732–3741.
NOTE: Day(s) Month Year, City, Abbrev. State, Country of Conference, Publiser, and Place of Publication are optional and omitted.

Elsevier Name–Date (Harvard) Style: Kang, M., Ting, F.F., Phan, R.C.-W., Ting, C.-M., 2025. PK-YOLO: pretrained knowledge guided YOLO for brain tumor detection in multiplane MRI slice. In: Proceedings of the 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 28 Februray–4 March 2025, Tucson, AZ, USA. IEEE, Piscataway, New York, USA, pp. 3732–3741.
NOTE: Day(s) Month Year, City, Abbrev. State, Country of Conference, Publiser, and Place of Publication are optional.

APA7: Kang, M., Ting, F.F., Phan, R.C.-W., & Ting, C.-M. (2025). PK-YOLO: Pretrained knowledge guided YOLO for brain tumor detection in multiplane MRI slice. In Proceedings of the 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp. 3732–3741). IEEE. https://doi.org/10.1109/WACV61041.2025.00367

Model Card Author

Ming Kang

Model Card Contact

[email protected]

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mkang315/PK-YOLO

Finetuned
(1)
this model

Dataset used to train mkang315/PK-YOLO