Spaces:

Anuj-Panthri
/

Image-Colorization

Runtime error

@@ -1,59 +1,76 @@
-Image Colorization
 ==============================
-A short description of the project.
-Project Organization
-------------
     ├── LICENSE
-    ├── Makefile           <- Makefile with commands like `make data` or `make train`
     ├── README.md          <- The top-level README for developers using this project.
-    ├── data
-    │   ├── external       <- Data from third party sources.
-    │   ├── interim        <- Intermediate data that has been transformed.
-    │   ├── processed      <- The final, canonical data sets for modeling.
-    │   └── raw            <- The original, immutable data dump.
-    │
-    ├── docs               <- A default Sphinx project; see sphinx-doc.org for details
-    │
-    ├── models             <- Trained and serialized models, model predictions, or model summaries
-    │
-    ├── notebooks          <- Jupyter notebooks. Naming convention is a number (for ordering),
-    │                         the creator's initials, and a short `-` delimited description, e.g.
-    │                         `1.0-jqp-initial-data-exploration`.
-    │
-    ├── references         <- Data dictionaries, manuals, and all other explanatory materials.
-    │
-    ├── reports            <- Generated analysis as HTML, PDF, LaTeX, etc.
-    │   └── figures        <- Generated graphics and figures to be used in reporting
-    │
-    ├── requirements.txt   <- The requirements file for reproducing the analysis environment, e.g.
-    │                         generated with `pip freeze > requirements.txt`
-    │
-    ├── setup.py           <- makes project pip installable (pip install -e .) so src can be imported
-    ├── src                <- Source code for use in this project.
-    │   ├── __init__.py    <- Makes src a Python module
-    │   │
-    │   ├── data           <- Scripts to download or generate data
-    │   │   └── make_dataset.py
-    │   │
-    │   ├── features       <- Scripts to turn raw data into features for modeling
-    │   │   └── build_features.py
-    │   │
-    │   ├── models         <- Scripts to train models and then use trained models to make
-    │   │   │                 predictions
-    │   │   ├── predict_model.py
-    │   │   └── train_model.py
-    │   │
-    │   └── visualization  <- Scripts to create exploratory and results oriented visualizations
-    │       └── visualize.py
-    │
-    └── tox.ini            <- tox file with settings for running tox; see tox.readthedocs.io
 --------
 <p><small>Project based on the <a target="_blank" href="https://drivendata.github.io/cookiecutter-data-science/">cookiecutter data science project template</a>. #cookiecutterdatascience</small></p>

+## Image Colorization
 ==============================
+An deep learning based Image Colorization project.
+## FINDINGS
+- the task we want to learn is `image-colorization` but we can accompolish that by doing different types of tasks, I call these **sub-task**, in our content they could be like `regression based image colorization`, `classification(by binning) based colorization`, `GAN based colorization`, `image colorization + scene classication(Let there be colors research paper did this)`.
+- based on analysis and while I was trying to come up with a project file structure I came to know that the data, model, loss, metrics, dataloader all these are very coupled while dealing with a particular task(`image-colorization`) but when we talk about a **sub-task** we have much more freedom.
+- within a sub-task(e.g., regression-unet-learner) we already made a set of rules and now we can use different models without changing the data, or we can change different datasets while using the same model, **so it is important to fix the sub-task we want to do first.**
+- so making a folder for each sub-task seems right as a sub-task has high cohesion and no coupling with any other sub-task.
+## RULES
+- use **lower_snake_case** for **functions**
+- use **lower_snake_case** for **file_name & folder names**
+- use **UpperCamelCase** for **class names**
+- **sub-task** name should be in **lower-kebab-case**
+## Project File Structure
+------------
+    .
     ├── LICENSE
     ├── README.md          <- The top-level README for developers using this project.
+    ├── data/
+    │   ├── external       <- Data from third party sources.
+    │   ├── interim        <- Intermediate data that has been transformed.
+    │   ├── processed      <- The final, canonical data sets for modeling.
+    │   └── raw            <- The original, immutable data dump.
+    ├── models/             <- Trained models
+    ├── notebooks/          <- Jupyter notebooks
+    ├── configs/
+    │   ├── experiment1.yaml
+    │   ├── experiment2.yaml
+    │   ├── experiment3.yaml
+    │   └── ...
+    └── src/
+        ├── sub_task_1/
+        │   ├── validate_config.py
+        │   ├── data/
+        │   │   ├── register_datasets.py
+        │   │   ├── datasets/
+        │   │   │   ├── dataset1.py
+        │   │   │   └─��� dataset2.py
+        │   ├── model/
+        │   │   ├── base_model_interface.py
+        │   │   ├── register_models.py
+        │   │   ├── models/
+        │   │   │   ├── simple_model.py
+        │   │   │   └── complex_model.py
+        │   │   ├── losses.py
+        │   │   ├── metrics.py
+        │   │   ├── callbacks.py
+        │   │   └── dataloader.py
+        │   └── scripts/
+        │       ├── create_dataset.py
+        │       └── create_model.py
+        ├── sub_task_2/
+        │   └── ...
+        ├── sub_task_3/
+        │   └── ...
+        ├── scripts/
+        │   ├── create_sub_task.py
+        │   ├── prepare_dataset.py
+        │   ├── visualize_dataset.py
+        │   ├── visualize_results.py
+        │   ├── train.py
+        │   ├── evaluate.py
+        │   └── inference.py
+        └── utils/
+            ├── data_utils.py
+            └── model_utils.py
 --------
 <p><small>Project based on the <a target="_blank" href="https://drivendata.github.io/cookiecutter-data-science/">cookiecutter data science project template</a>. #cookiecutterdatascience</small></p>

command.py DELETED Viewed

@@ -1,40 +0,0 @@
-import argparse
-import sys
-import os
-# parser = argparse.ArgumentParser()
-# parser.add_argument("category")
-# parser.add_argument("subcommand-args")
-# args = parser.parse_args()
-args = sys.argv
-# remove "command.py"
-args = args[1:]
-# print(args)
-subcommand = args[0].lower()
-subcommand_args = " ".join(args[1:])
-if subcommand=="data":
-    command = "py src/data/make_dataset.py "+subcommand_args
-    # print(command)
-    os.system(command)
-else:
-    print("subcommand not supported.")
-# os.system("py src/__init__.py")
-"""
-download the dataset:                 data download
-preprocess dataset:                   data prepare
-visualize dataset:                    data show
-delete raw & interim dataset dir:     data delete --cache
-delete all dataset dir:               data delete --all
-train model:                          model train
-evaluate model:                       model evaluate
-inference with model:                 model predict --image test.jpg --folder images/ -d results/
-"""

config.yaml → configs/experiment1.yaml RENAMED Viewed

@@ -1,13 +1,14 @@
-raw_dataset_dir: data/raw/
-interim_dataset_dir: data/interim/
-processed_dataset_dir: data/processed/
-# forests or pascal-voc
 dataset: forests
-image_size: 224
 train_size: 0.8
 shuffle: False
-batch_size: 16
-seed: 324

+# mandatory
+task: simple_regression_colorization
 dataset: forests
+model: model_v1
+# common parameters
+seed: 324
 train_size: 0.8
+image_size: 224
 shuffle: False
+# training related
+batch_size: 16
+epochs: 10

constants.yaml ADDED Viewed

	@@ -0,0 +1,3 @@

+RAW_DATASET_DIR: data/raw/
+INTERIM_DATASET_DIR: data/interim/
+PROCESSED_DATASET_DIR: data/processed/

docs/Makefile DELETED Viewed

@@ -1,153 +0,0 @@
-# Makefile for Sphinx documentation
-#
-# You can set these variables from the command line.
-SPHINXOPTS    =
-SPHINXBUILD   = sphinx-build
-PAPER         =
-BUILDDIR      = _build
-# Internal variables.
-PAPEROPT_a4     = -D latex_paper_size=a4
-PAPEROPT_letter = -D latex_paper_size=letter
-ALLSPHINXOPTS   = -d $(BUILDDIR)/doctrees $(PAPEROPT_$(PAPER)) $(SPHINXOPTS) .
-# the i18n builder cannot share the environment and doctrees with the others
-I18NSPHINXOPTS  = $(PAPEROPT_$(PAPER)) $(SPHINXOPTS) .
-.PHONY: help clean html dirhtml singlehtml pickle json htmlhelp qthelp devhelp epub latex latexpdf text man changes linkcheck doctest gettext
-help:
-	@echo "Please use \`make <target>' where <target> is one of"
-	@echo "  html       to make standalone HTML files"
-	@echo "  dirhtml    to make HTML files named index.html in directories"
-	@echo "  singlehtml to make a single large HTML file"
-	@echo "  pickle     to make pickle files"
-	@echo "  json       to make JSON files"
-	@echo "  htmlhelp   to make HTML files and a HTML help project"
-	@echo "  qthelp     to make HTML files and a qthelp project"
-	@echo "  devhelp    to make HTML files and a Devhelp project"
-	@echo "  epub       to make an epub"
-	@echo "  latex      to make LaTeX files, you can set PAPER=a4 or PAPER=letter"
-	@echo "  latexpdf   to make LaTeX files and run them through pdflatex"
-	@echo "  text       to make text files"
-	@echo "  man        to make manual pages"
-	@echo "  texinfo    to make Texinfo files"
-	@echo "  info       to make Texinfo files and run them through makeinfo"
-	@echo "  gettext    to make PO message catalogs"
-	@echo "  changes    to make an overview of all changed/added/deprecated items"
-	@echo "  linkcheck  to check all external links for integrity"
-	@echo "  doctest    to run all doctests embedded in the documentation (if enabled)"
-clean:
-	-rm -rf $(BUILDDIR)/*
-html:
-	$(SPHINXBUILD) -b html $(ALLSPHINXOPTS) $(BUILDDIR)/html
-	@echo
-	@echo "Build finished. The HTML pages are in $(BUILDDIR)/html."
-dirhtml:
-	$(SPHINXBUILD) -b dirhtml $(ALLSPHINXOPTS) $(BUILDDIR)/dirhtml
-	@echo
-	@echo "Build finished. The HTML pages are in $(BUILDDIR)/dirhtml."
-singlehtml:
-	$(SPHINXBUILD) -b singlehtml $(ALLSPHINXOPTS) $(BUILDDIR)/singlehtml
-	@echo
-	@echo "Build finished. The HTML page is in $(BUILDDIR)/singlehtml."
-pickle:
-	$(SPHINXBUILD) -b pickle $(ALLSPHINXOPTS) $(BUILDDIR)/pickle
-	@echo
-	@echo "Build finished; now you can process the pickle files."
-json:
-	$(SPHINXBUILD) -b json $(ALLSPHINXOPTS) $(BUILDDIR)/json
-	@echo
-	@echo "Build finished; now you can process the JSON files."
-htmlhelp:
-	$(SPHINXBUILD) -b htmlhelp $(ALLSPHINXOPTS) $(BUILDDIR)/htmlhelp
-	@echo
-	@echo "Build finished; now you can run HTML Help Workshop with the" \
-	      ".hhp project file in $(BUILDDIR)/htmlhelp."
-qthelp:
-	$(SPHINXBUILD) -b qthelp $(ALLSPHINXOPTS) $(BUILDDIR)/qthelp
-	@echo
-	@echo "Build finished; now you can run "qcollectiongenerator" with the" \
-	      ".qhcp project file in $(BUILDDIR)/qthelp, like this:"
-	@echo "# qcollectiongenerator $(BUILDDIR)/qthelp/project_name.qhcp"
-	@echo "To view the help file:"
-	@echo "# assistant -collectionFile $(BUILDDIR)/qthelp/project_name.qhc"
-devhelp:
-	$(SPHINXBUILD) -b devhelp $(ALLSPHINXOPTS) $(BUILDDIR)/devhelp
-	@echo
-	@echo "Build finished."
-	@echo "To view the help file:"
-	@echo "# mkdir -p $$HOME/.local/share/devhelp/project_name"
-	@echo "# ln -s $(BUILDDIR)/devhelp $$HOME/.local/share/devhelp/project_name"
-	@echo "# devhelp"
-epub:
-	$(SPHINXBUILD) -b epub $(ALLSPHINXOPTS) $(BUILDDIR)/epub
-	@echo
-	@echo "Build finished. The epub file is in $(BUILDDIR)/epub."
-latex:
-	$(SPHINXBUILD) -b latex $(ALLSPHINXOPTS) $(BUILDDIR)/latex
-	@echo
-	@echo "Build finished; the LaTeX files are in $(BUILDDIR)/latex."
-	@echo "Run \`make' in that directory to run these through (pdf)latex" \
-	      "(use \`make latexpdf' here to do that automatically)."
-latexpdf:
-	$(SPHINXBUILD) -b latex $(ALLSPHINXOPTS) $(BUILDDIR)/latex
-	@echo "Running LaTeX files through pdflatex..."
-	$(MAKE) -C $(BUILDDIR)/latex all-pdf
-	@echo "pdflatex finished; the PDF files are in $(BUILDDIR)/latex."
-text:
-	$(SPHINXBUILD) -b text $(ALLSPHINXOPTS) $(BUILDDIR)/text
-	@echo
-	@echo "Build finished. The text files are in $(BUILDDIR)/text."
-man:
-	$(SPHINXBUILD) -b man $(ALLSPHINXOPTS) $(BUILDDIR)/man
-	@echo
-	@echo "Build finished. The manual pages are in $(BUILDDIR)/man."
-texinfo:
-	$(SPHINXBUILD) -b texinfo $(ALLSPHINXOPTS) $(BUILDDIR)/texinfo
-	@echo
-	@echo "Build finished. The Texinfo files are in $(BUILDDIR)/texinfo."
-	@echo "Run \`make' in that directory to run these through makeinfo" \
-	      "(use \`make info' here to do that automatically)."
-info:
-	$(SPHINXBUILD) -b texinfo $(ALLSPHINXOPTS) $(BUILDDIR)/texinfo
-	@echo "Running Texinfo files through makeinfo..."
-	make -C $(BUILDDIR)/texinfo info
-	@echo "makeinfo finished; the Info files are in $(BUILDDIR)/texinfo."
-gettext:
-	$(SPHINXBUILD) -b gettext $(I18NSPHINXOPTS) $(BUILDDIR)/locale
-	@echo
-	@echo "Build finished. The message catalogs are in $(BUILDDIR)/locale."
-changes:
-	$(SPHINXBUILD) -b changes $(ALLSPHINXOPTS) $(BUILDDIR)/changes
-	@echo
-	@echo "The overview file is in $(BUILDDIR)/changes."
-linkcheck:
-	$(SPHINXBUILD) -b linkcheck $(ALLSPHINXOPTS) $(BUILDDIR)/linkcheck
-	@echo
-	@echo "Link check complete; look for any errors in the above output " \
-	      "or in $(BUILDDIR)/linkcheck/output.txt."
-doctest:
-	$(SPHINXBUILD) -b doctest $(ALLSPHINXOPTS) $(BUILDDIR)/doctest
-	@echo "Testing of doctests in the sources finished, look at the " \
-	      "results in $(BUILDDIR)/doctest/output.txt."

docs/commands.rst DELETED Viewed

@@ -1,10 +0,0 @@
-Commands
-========
-The Makefile contains the central entry points for common tasks related to this project.
-Syncing data to S3
-^^^^^^^^^^^^^^^^^^
-* `make sync_data_to_s3` will use `aws s3 sync` to recursively sync files in `data/` up to `s3://[OPTIONAL] your-bucket-for-syncing-data (do not include 's3://')/data/`.
-* `make sync_data_from_s3` will use `aws s3 sync` to recursively sync files from `s3://[OPTIONAL] your-bucket-for-syncing-data (do not include 's3://')/data/` to `data/`.

docs/conf.py DELETED Viewed

@@ -1,244 +0,0 @@
-# -*- coding: utf-8 -*-
-#
-# project_name documentation build configuration file, created by
-# sphinx-quickstart.
-#
-# This file is execfile()d with the current directory set to its containing dir.
-#
-# Note that not all possible configuration values are present in this
-# autogenerated file.
-#
-# All configuration values have a default; values that are commented out
-# serve to show the default.
-import os
-import sys
-# If extensions (or modules to document with autodoc) are in another directory,
-# add these directories to sys.path here. If the directory is relative to the
-# documentation root, use os.path.abspath to make it absolute, like shown here.
-# sys.path.insert(0, os.path.abspath('.'))
-# -- General configuration -----------------------------------------------------
-# If your documentation needs a minimal Sphinx version, state it here.
-# needs_sphinx = '1.0'
-# Add any Sphinx extension module names here, as strings. They can be extensions
-# coming with Sphinx (named 'sphinx.ext.*') or your custom ones.
-extensions = []
-# Add any paths that contain templates here, relative to this directory.
-templates_path = ['_templates']
-# The suffix of source filenames.
-source_suffix = '.rst'
-# The encoding of source files.
-# source_encoding = 'utf-8-sig'
-# The master toctree document.
-master_doc = 'index'
-# General information about the project.
-project = u'project_name'
-# The version info for the project you're documenting, acts as replacement for
-# |version| and |release|, also used in various other places throughout the
-# built documents.
-#
-# The short X.Y version.
-version = '0.1'
-# The full version, including alpha/beta/rc tags.
-release = '0.1'
-# The language for content autogenerated by Sphinx. Refer to documentation
-# for a list of supported languages.
-# language = None
-# There are two options for replacing |today|: either, you set today to some
-# non-false value, then it is used:
-# today = ''
-# Else, today_fmt is used as the format for a strftime call.
-# today_fmt = '%B %d, %Y'
-# List of patterns, relative to source directory, that match files and
-# directories to ignore when looking for source files.
-exclude_patterns = ['_build']
-# The reST default role (used for this markup: `text`) to use for all documents.
-# default_role = None
-# If true, '()' will be appended to :func: etc. cross-reference text.
-# add_function_parentheses = True
-# If true, the current module name will be prepended to all description
-# unit titles (such as .. function::).
-# add_module_names = True
-# If true, sectionauthor and moduleauthor directives will be shown in the
-# output. They are ignored by default.
-# show_authors = False
-# The name of the Pygments (syntax highlighting) style to use.
-pygments_style = 'sphinx'
-# A list of ignored prefixes for module index sorting.
-# modindex_common_prefix = []
-# -- Options for HTML output ---------------------------------------------------
-# The theme to use for HTML and HTML Help pages.  See the documentation for
-# a list of builtin themes.
-html_theme = 'default'
-# Theme options are theme-specific and customize the look and feel of a theme
-# further.  For a list of options available for each theme, see the
-# documentation.
-# html_theme_options = {}
-# Add any paths that contain custom themes here, relative to this directory.
-# html_theme_path = []
-# The name for this set of Sphinx documents.  If None, it defaults to
-# "<project> v<release> documentation".
-# html_title = None
-# A shorter title for the navigation bar.  Default is the same as html_title.
-# html_short_title = None
-# The name of an image file (relative to this directory) to place at the top
-# of the sidebar.
-# html_logo = None
-# The name of an image file (within the static path) to use as favicon of the
-# docs.  This file should be a Windows icon file (.ico) being 16x16 or 32x32
-# pixels large.
-# html_favicon = None
-# Add any paths that contain custom static files (such as style sheets) here,
-# relative to this directory. They are copied after the builtin static files,
-# so a file named "default.css" will overwrite the builtin "default.css".
-html_static_path = ['_static']
-# If not '', a 'Last updated on:' timestamp is inserted at every page bottom,
-# using the given strftime format.
-# html_last_updated_fmt = '%b %d, %Y'
-# If true, SmartyPants will be used to convert quotes and dashes to
-# typographically correct entities.
-# html_use_smartypants = True
-# Custom sidebar templates, maps document names to template names.
-# html_sidebars = {}
-# Additional templates that should be rendered to pages, maps page names to
-# template names.
-# html_additional_pages = {}
-# If false, no module index is generated.
-# html_domain_indices = True
-# If false, no index is generated.
-# html_use_index = True
-# If true, the index is split into individual pages for each letter.
-# html_split_index = False
-# If true, links to the reST sources are added to the pages.
-# html_show_sourcelink = True
-# If true, "Created using Sphinx" is shown in the HTML footer. Default is True.
-# html_show_sphinx = True
-# If true, "(C) Copyright ..." is shown in the HTML footer. Default is True.
-# html_show_copyright = True
-# If true, an OpenSearch description file will be output, and all pages will
-# contain a <link> tag referring to it.  The value of this option must be the
-# base URL from which the finished HTML is served.
-# html_use_opensearch = ''
-# This is the file name suffix for HTML files (e.g. ".xhtml").
-# html_file_suffix = None
-# Output file base name for HTML help builder.
-htmlhelp_basename = 'project_namedoc'
-# -- Options for LaTeX output --------------------------------------------------
-latex_elements = {
-    # The paper size ('letterpaper' or 'a4paper').
-    # 'papersize': 'letterpaper',
-    # The font size ('10pt', '11pt' or '12pt').
-    # 'pointsize': '10pt',
-    # Additional stuff for the LaTeX preamble.
-    # 'preamble': '',
-}
-# Grouping the document tree into LaTeX files. List of tuples
-# (source start file, target name, title, author, documentclass [howto/manual]).
-latex_documents = [
-    ('index',
-     'project_name.tex',
-     u'project_name Documentation',
-     u"Your name (or your organization/company/team)", 'manual'),
-]
-# The name of an image file (relative to this directory) to place at the top of
-# the title page.
-# latex_logo = None
-# For "manual" documents, if this is true, then toplevel headings are parts,
-# not chapters.
-# latex_use_parts = False
-# If true, show page references after internal links.
-# latex_show_pagerefs = False
-# If true, show URL addresses after external links.
-# latex_show_urls = False
-# Documents to append as an appendix to all manuals.
-# latex_appendices = []
-# If false, no module index is generated.
-# latex_domain_indices = True
-# -- Options for manual page output --------------------------------------------
-# One entry per manual page. List of tuples
-# (source start file, name, description, authors, manual section).
-man_pages = [
-    ('index', 'project_name', u'project_name Documentation',
-     [u"Your name (or your organization/company/team)"], 1)
-]
-# If true, show URL addresses after external links.
-# man_show_urls = False
-# -- Options for Texinfo output ------------------------------------------------
-# Grouping the document tree into Texinfo files. List of tuples
-# (source start file, target name, title, author,
-#  dir menu entry, description, category)
-texinfo_documents = [
-    ('index', 'project_name', u'project_name Documentation',
-     u"Your name (or your organization/company/team)", 'project_name',
-     'A short description of the project.', 'Miscellaneous'),
-]
-# Documents to append as an appendix to all manuals.
-# texinfo_appendices = []
-# If false, no module index is generated.
-# texinfo_domain_indices = True
-# How to display URL addresses: 'footnote', 'no', or 'inline'.
-# texinfo_show_urls = 'footnote'

docs/getting-started.rst DELETED Viewed

@@ -1,6 +0,0 @@
-Getting started
-===============
-This is where you describe how to get set up on a clean install, including the
-commands necessary to get the raw data (using the `sync_data_from_s3` command,
-for example), and then how to make the cleaned, final data sets.

docs/index.rst DELETED Viewed

@@ -1,24 +0,0 @@
-.. project_name documentation master file, created by
-   sphinx-quickstart.
-   You can adapt this file completely to your liking, but it should at least
-   contain the root `toctree` directive.
-project_name documentation!
-==============================================
-Contents:
-.. toctree::
-   :maxdepth: 2
-   getting-started
-   commands
-Indices and tables
-==================
-* :ref:`genindex`
-* :ref:`modindex`
-* :ref:`search`

docs/make.bat DELETED Viewed

@@ -1,190 +0,0 @@
-@ECHO OFF
-REM Command file for Sphinx documentation
-if "%SPHINXBUILD%" == "" (
-	set SPHINXBUILD=sphinx-build
-)
-set BUILDDIR=_build
-set ALLSPHINXOPTS=-d %BUILDDIR%/doctrees %SPHINXOPTS% .
-set I18NSPHINXOPTS=%SPHINXOPTS% .
-if NOT "%PAPER%" == "" (
-	set ALLSPHINXOPTS=-D latex_paper_size=%PAPER% %ALLSPHINXOPTS%
-	set I18NSPHINXOPTS=-D latex_paper_size=%PAPER% %I18NSPHINXOPTS%
-)
-if "%1" == "" goto help
-if "%1" == "help" (
-	:help
-	echo.Please use `make ^<target^>` where ^<target^> is one of
-	echo.  html       to make standalone HTML files
-	echo.  dirhtml    to make HTML files named index.html in directories
-	echo.  singlehtml to make a single large HTML file
-	echo.  pickle     to make pickle files
-	echo.  json       to make JSON files
-	echo.  htmlhelp   to make HTML files and a HTML help project
-	echo.  qthelp     to make HTML files and a qthelp project
-	echo.  devhelp    to make HTML files and a Devhelp project
-	echo.  epub       to make an epub
-	echo.  latex      to make LaTeX files, you can set PAPER=a4 or PAPER=letter
-	echo.  text       to make text files
-	echo.  man        to make manual pages
-	echo.  texinfo    to make Texinfo files
-	echo.  gettext    to make PO message catalogs
-	echo.  changes    to make an overview over all changed/added/deprecated items
-	echo.  linkcheck  to check all external links for integrity
-	echo.  doctest    to run all doctests embedded in the documentation if enabled
-	goto end
-)
-if "%1" == "clean" (
-	for /d %%i in (%BUILDDIR%\*) do rmdir /q /s %%i
-	del /q /s %BUILDDIR%\*
-	goto end
-)
-if "%1" == "html" (
-	%SPHINXBUILD% -b html %ALLSPHINXOPTS% %BUILDDIR%/html
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished. The HTML pages are in %BUILDDIR%/html.
-	goto end
-)
-if "%1" == "dirhtml" (
-	%SPHINXBUILD% -b dirhtml %ALLSPHINXOPTS% %BUILDDIR%/dirhtml
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished. The HTML pages are in %BUILDDIR%/dirhtml.
-	goto end
-)
-if "%1" == "singlehtml" (
-	%SPHINXBUILD% -b singlehtml %ALLSPHINXOPTS% %BUILDDIR%/singlehtml
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished. The HTML pages are in %BUILDDIR%/singlehtml.
-	goto end
-)
-if "%1" == "pickle" (
-	%SPHINXBUILD% -b pickle %ALLSPHINXOPTS% %BUILDDIR%/pickle
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished; now you can process the pickle files.
-	goto end
-)
-if "%1" == "json" (
-	%SPHINXBUILD% -b json %ALLSPHINXOPTS% %BUILDDIR%/json
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished; now you can process the JSON files.
-	goto end
-)
-if "%1" == "htmlhelp" (
-	%SPHINXBUILD% -b htmlhelp %ALLSPHINXOPTS% %BUILDDIR%/htmlhelp
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished; now you can run HTML Help Workshop with the ^
-.hhp project file in %BUILDDIR%/htmlhelp.
-	goto end
-)
-if "%1" == "qthelp" (
-	%SPHINXBUILD% -b qthelp %ALLSPHINXOPTS% %BUILDDIR%/qthelp
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished; now you can run "qcollectiongenerator" with the ^
-.qhcp project file in %BUILDDIR%/qthelp, like this:
-	echo.^> qcollectiongenerator %BUILDDIR%\qthelp\project_name.qhcp
-	echo.To view the help file:
-	echo.^> assistant -collectionFile %BUILDDIR%\qthelp\project_name.ghc
-	goto end
-)
-if "%1" == "devhelp" (
-	%SPHINXBUILD% -b devhelp %ALLSPHINXOPTS% %BUILDDIR%/devhelp
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished.
-	goto end
-)
-if "%1" == "epub" (
-	%SPHINXBUILD% -b epub %ALLSPHINXOPTS% %BUILDDIR%/epub
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished. The epub file is in %BUILDDIR%/epub.
-	goto end
-)
-if "%1" == "latex" (
-	%SPHINXBUILD% -b latex %ALLSPHINXOPTS% %BUILDDIR%/latex
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished; the LaTeX files are in %BUILDDIR%/latex.
-	goto end
-)
-if "%1" == "text" (
-	%SPHINXBUILD% -b text %ALLSPHINXOPTS% %BUILDDIR%/text
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished. The text files are in %BUILDDIR%/text.
-	goto end
-)
-if "%1" == "man" (
-	%SPHINXBUILD% -b man %ALLSPHINXOPTS% %BUILDDIR%/man
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished. The manual pages are in %BUILDDIR%/man.
-	goto end
-)
-if "%1" == "texinfo" (
-	%SPHINXBUILD% -b texinfo %ALLSPHINXOPTS% %BUILDDIR%/texinfo
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished. The Texinfo files are in %BUILDDIR%/texinfo.
-	goto end
-)
-if "%1" == "gettext" (
-	%SPHINXBUILD% -b gettext %I18NSPHINXOPTS% %BUILDDIR%/locale
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Build finished. The message catalogs are in %BUILDDIR%/locale.
-	goto end
-)
-if "%1" == "changes" (
-	%SPHINXBUILD% -b changes %ALLSPHINXOPTS% %BUILDDIR%/changes
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.The overview file is in %BUILDDIR%/changes.
-	goto end
-)
-if "%1" == "linkcheck" (
-	%SPHINXBUILD% -b linkcheck %ALLSPHINXOPTS% %BUILDDIR%/linkcheck
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Link check complete; look for any errors in the above output ^
-or in %BUILDDIR%/linkcheck/output.txt.
-	goto end
-)
-if "%1" == "doctest" (
-	%SPHINXBUILD% -b doctest %ALLSPHINXOPTS% %BUILDDIR%/doctest
-	if errorlevel 1 exit /b 1
-	echo.
-	echo.Testing of doctests in the sources finished, look at the ^
-results in %BUILDDIR%/doctest/output.txt.
-	goto end
-)
-:end

references/.gitkeep DELETED Viewed

File without changes

reports/.gitkeep DELETED Viewed

File without changes

reports/figures/.gitkeep DELETED Viewed

File without changes

requirements.txt CHANGED Viewed

@@ -1,3 +1,4 @@
 huggingface_hub
 comet_ml
-scikit-image

 huggingface_hub
 comet_ml
+scikit-image
+cerberus

setup.py CHANGED Viewed

@@ -7,4 +7,4 @@ setup(
     description='A short description of the project.',
     author='Your name (or your organization/company/team)',
     license='MIT',
-)

     description='A short description of the project.',
     author='Your name (or your organization/company/team)',
     license='MIT',
+)

src/__init__.py DELETED Viewed

@@ -1,9 +0,0 @@
-from src.utils import Config
-from pathlib import Path
-config = Config("config.yaml")
-# config.raw_dataset_dir = Path(config.raw_dataset_dir)
-# config.interim_dataset_dir = Path(config.interim_dataset_dir)
-# config.processed_dataset_dir = Path(config.processed_dataset_dir)
-# print(config)

src/data/.gitkeep DELETED Viewed

File without changes

src/data/__init__.py DELETED Viewed

File without changes

src/data/make_dataset.py DELETED Viewed

@@ -1,128 +0,0 @@
-from huggingface_hub import snapshot_download
-import os,sys;sys.path.append(os.getcwd())
-from src import config
-from src.utils import *
-import argparse
-from pathlib import Path
-from zipfile import ZipFile
-from glob import glob
-import cv2
-import numpy as np
-import matplotlib.pyplot as plt
-from tqdm import tqdm
-import shutil
-from src.data.visualize_dataset import visualize_dataset
-def download_dataset():
-    """Used to download dataset from hugging face
-    """
-    print_title(f"Downloading {config.dataset} dataset from hugging face")
-    snapshot_download(repo_id="Anuj-Panthri/Image-Colorization-Datasets",
-                    repo_type="dataset",
-                    local_dir=config.raw_dataset_dir,
-                    allow_patterns=f"{config.dataset}/*")
-def unzip_dataset():
-    print_title(f"Unzipping dataset")
-    print("Extracting to :",Path(config.interim_dataset_dir)/Path("trainval/"))
-    with ZipFile(Path(config.raw_dataset_dir)/Path(f"{config.dataset}/trainval.zip"),"r") as zip:
-        zip.extractall(Path(config.interim_dataset_dir)/Path("trainval/"))
-    print("Extracting to :",Path(config.interim_dataset_dir)/Path("test/"))
-    with ZipFile(Path(config.raw_dataset_dir)/Path(f"{config.dataset}/test.zip"),"r") as zip:
-        zip.extractall(Path(config.interim_dataset_dir)/Path("test/"))
-def clean_dataset():
-    print_title("CLEANING DATASET")
-    trainval_dir = Path(config.interim_dataset_dir) / Path("trainval/")
-    test_dir = Path(config.interim_dataset_dir) / Path("test/")
-    trainval_paths = glob(str(trainval_dir/Path("*")))
-    test_paths = glob(str(test_dir/Path("*")))
-    print("train,test: ",len(trainval_paths),",",len(test_paths),sep="")
-    def clean(image_paths,destination_dir):
-        if os.path.exists(destination_dir): shutil.rmtree(destination_dir)
-        os.makedirs(destination_dir)
-        for i in tqdm(range(len(image_paths))):
-            img = cv2.imread(image_paths[i])
-            img = cv2.resize(img,[128,128])
-            if not is_bw(img):
-                shutil.copy(trainval_paths[i],
-                            destination_dir)
-        print("saved to:",destination_dir)
-    destination_dir = Path(config.processed_dataset_dir)/Path("trainval/")
-    clean(trainval_paths,destination_dir)
-    destination_dir = Path(config.processed_dataset_dir)/Path("test/")
-    clean(test_paths,destination_dir)
-    trainval_dir = Path(config.processed_dataset_dir) / Path("trainval/")
-    test_dir = Path(config.processed_dataset_dir) / Path("test/")
-    trainval_paths = glob(str(trainval_dir/Path("*")))
-    test_paths = glob(str(test_dir/Path("*")))
-    print("after cleaning train,test: ",len(trainval_paths),",",len(test_paths),sep="")
-def prepare_dataset():
-    print_title(f"Preparing dataset")
-    download_dataset()
-    unzip_dataset()
-    clean_dataset()
-def delete_cache():
-    ## clean old interim and raw datasets
-    print_title("deleting unused raw and interim dataset dirs")
-    if os.path.exists(config.raw_dataset_dir):
-        shutil.rmtree(config.raw_dataset_dir)
-    if os.path.exists(config.interim_dataset_dir):
-        shutil.rmtree(config.interim_dataset_dir)
-def delete_all():
-    ## clean all datasets
-    print_title("deleting all dataset dirs")
-    if os.path.exists(config.raw_dataset_dir):
-        shutil.rmtree(config.raw_dataset_dir)
-    if os.path.exists(config.interim_dataset_dir):
-        shutil.rmtree(config.interim_dataset_dir)
-    if os.path.exists(config.processed_dataset_dir):
-        shutil.rmtree(config.processed_dataset_dir)
-if __name__=="__main__":
-    parser = argparse.ArgumentParser()
-    parser.add_argument("command")
-    parser.add_argument("-d","--dataset",default="forests")
-    parser.add_argument("--cache",action="store_true",default=True)
-    parser.add_argument("--all",action="store_true")
-    """
-        prepare dataset:                      data prepare
-        visualize dataset:                    data show
-        delete raw & interim dataset dir:     data delete --cache
-        delete all dataset dir:               data delete --all
-    """
-    args = parser.parse_args()
-    # print(args)
-    if args.command=="prepare":
-        prepare_dataset()
-    elif args.command=="show":
-        visualize_dataset()
-    elif args.command=="delete":
-        if(args.all): delete_all()
-        elif(args.cache): delete_cache()
-    else:
-        print("unsupported")

src/data/visualize_dataset.py DELETED Viewed

@@ -1,52 +0,0 @@
-import os,sys;sys.path.append(os.getcwd())
-from src.data.load_dataset import get_ds,get_datasets
-from src import config
-from src.utils import *
-import matplotlib.pyplot as plt
-import cv2
-import math
-def see_batch(L_batch,AB_batch,show_L=False,cols=4,row_size=5,col_size=5,title=None):
-    n = L_batch.shape[0]
-    rows = math.ceil(n/cols)
-    fig = plt.figure(figsize=(col_size*cols,row_size*rows))
-    if title:
-        plt.title(title)
-    plt.axis("off")
-    for i in range(n):
-        fig.add_subplot(rows,cols,i+1)
-        L,AB = L_batch[i],AB_batch[i]
-        L,AB = rescale_L(L), rescale_AB(AB)
-#         print(L.shape,AB.shape)
-        img = np.concatenate([L,AB],axis=-1)
-        img = cv2.cvtColor(img,cv2.COLOR_LAB2RGB)*255
-#         print(img.min(),img.max())
-        if show_L:
-            L = np.tile(L,(1,1,3))/100*255
-            img = np.concatenate([L,img],axis=1)
-        plt.imshow(img.astype("uint8"))
-    plt.show()
-def visualize_dataset():
-    train_ds,val_ds,test_ds = get_datasets()
-    L_batch,AB_batch = next(iter(train_ds))
-    L_batch,AB_batch = L_batch.numpy(), AB_batch.numpy()
-    see_batch(L_batch,
-              AB_batch,
-              title="training dataset")
-    L_batch,AB_batch = next(iter(val_ds))
-    L_batch,AB_batch = L_batch.numpy(), AB_batch.numpy()
-    see_batch(L_batch,
-              AB_batch,
-              title="validation dataset")
-    L_batch,AB_batch = next(iter(test_ds))
-    L_batch,AB_batch = L_batch.numpy(), AB_batch.numpy()
-    see_batch(L_batch,
-              AB_batch,
-              title="testing dataset")

src/features/.gitkeep DELETED Viewed

File without changes

src/features/__init__.py DELETED Viewed

File without changes

src/features/build_features.py DELETED Viewed

File without changes

src/models/.gitkeep DELETED Viewed

File without changes

src/models/__init__.py DELETED Viewed

File without changes

src/models/predict_model.py DELETED Viewed

File without changes

src/models/train_model.py DELETED Viewed

File without changes

src/scripts/create_sub_task.py ADDED Viewed

	@@ -0,0 +1,274 @@

+import os,shutil
+import argparse
+def create_file(file_path,file_content):
+    with open(file_path,"w") as f:
+        f.write(file_content)
+def create_data(data_dir,dataset_name,sub_task_dir):
+    # call src/sub_task/scripts/create_dataset.py dataset_name
+    os.system(f"python {sub_task_dir}/scripts/create_dataset.py {dataset_name}")
+    register_datasets_file_path = os.path.join(data_dir,"register_datasets.py")
+    create_file(register_datasets_file_path,
+f"""# register your datasets here
+datasets = ["{dataset_name}"]
+""")
+def create_model(model_dir:str, model_name:str, sub_task_dir:str):
+    base_model_interface_path = os.path.join(model_dir,"base_model_interface.py")
+    create_file(base_model_interface_path,
+"""import numpy as np
+from abc import ABC, abstractmethod
+# BaseModel Abstract class
+# all the models within this sub_task must inherit this class
+class BaseModel(ABC):
+    @abstractmethod
+    def train(self):
+        pass
+    @abstractmethod
+    def predict(self,inputs):
+        pass
+""")
+    # call src/sub_task/scripts/create_model.py model_name
+    os.system(f"python {sub_task_dir}/scripts/create_model.py {model_name}")
+    register_models_path = os.path.join(model_dir,"register_models.py")
+    create_file(register_models_path,
+f"""# register models of this sub_task here
+models = ["{model_name}"]
+""")
+    losses_path = os.path.join(model_dir,"losses.py")
+    create_file(losses_path,
+"""# define loss functions here
+""")
+    metrics_path = os.path.join(model_dir,"metrics.py")
+    create_file(metrics_path,
+"""# define metrics here
+""")
+    callbacks_path = os.path.join(model_dir,"callbacks.py")
+    create_file(callbacks_path,
+"""# define callbacks here
+""")
+    dataloaders_path = os.path.join(model_dir,"dataloaders.py")
+    create_file(dataloaders_path,
+"""# define dataloaders here
+""")
+def create_scripts(scripts_dir,sub_task):
+    create_dataset_path = os.path.join(scripts_dir,"create_dataset.py")
+    create_file(create_dataset_path,
+f"""import os,shutil
+import argparse
+def create_file(file_path,file_content):
+    with open(file_path,"w") as f:
+        f.write(file_content)
+def create_dataset(args):
+    dataset_name = args.name
+    force_flag = args.force
+    datasets_dir = os.path.join('src','{sub_task}','data','datasets')
+    os.makedirs(datasets_dir,exist_ok=True)
+    dataset_path = os.path.join(datasets_dir,dataset_name+".py")
+    # deleted old dataset if force flag exists and dataset already exists
+    if os.path.exists(dataset_path):
+        if force_flag:
+            print("Replacing existing dataset:",dataset_name)
+            shutil.remove(dataset_path)
+        else:
+            print(f"{{dataset_name}} already exists, use --force flag if you want to reset it to default")
+            exit()
+    create_file(dataset_path,
+\"\"\"# write dataset downloading preparation code in this file
+# Note: download_prepare() this is specially choosen name so don't change this function's name
+# you can add, remove and change any other function from this file
+def download_prepare():
+    \\"\\"\\" function used to download dataset and apply
+        all type of data preprocessing required to prepare the dataset
+    \\"\\"\\"
+    download_dataset()
+    unzip_dataset()
+    clean_dataset()
+    move_dataset()
+def download_dataset():
+    \\"\\"\\"download dataset\\"\\"\\"
+    pass
+def unzip_dataset():
+    \\"\\"\\"unzip dataset(if required)\\"\\"\\"
+    pass
+def clean_dataset():
+    \\"\\"\\"clean dataset(if required)\\"\\"\\"
+    pass
+def move_dataset():
+    \\"\\"\\"move dataset to processed folder\\"\\"\\"
+    pass
+\"\"\")
+def main():
+    parser = argparse.ArgumentParser(description="Create blueprint dataset")
+    parser.add_argument('name',type=str,help="name of dataset (e.g., pascal-voc)")
+    parser.add_argument("--force",action="store_true",help="forcefully replace old existing dataset to default",default=False)
+    args = parser.parse_args()
+    create_dataset(args)
+if __name__=="__main__":
+    main()
+""")
+    create_model_path = os.path.join(scripts_dir,"create_model.py")
+    create_file(create_model_path,
+f"""import os,shutil
+import argparse
+def create_file(file_path,file_content):
+    with open(file_path,"w") as f:
+        f.write(file_content)
+def create_model(args):
+    model_name = args.name
+    force_flag = args.force
+    models_dir = os.path.join('src','{sub_task}','model',"models")
+    os.makedirs(models_dir,exist_ok=True)
+    model_path = os.path.join(models_dir,model_name+".py")
+    # deleted old model if force flag exists and model already exists
+    if os.path.exists(model_path):
+        if force_flag:
+            print("Replacing existing model:",model_name)
+            shutil.remove(model_path)
+        else:
+            print(f"{{model_name}} already exists, use --force flag if you want to reset it to default")
+            exit()
+    model_name_camel_case = "".join([part.capitalize() for part in model_name.split("_")])
+    create_file(model_path,
+f\"\"\"from src.{sub_task}.model.base_model_interface import BaseModel
+class Model(BaseModel):
+    def train(self):
+        pass
+    def predict(self,inputs):
+        pass
+\"\"\")
+def main():
+    parser = argparse.ArgumentParser(description="Create blueprint model")
+    parser.add_argument('name',type=str,help="name of model (e.g., model_v2)")
+    parser.add_argument("--force",action="store_true",help="forcefully replace old existing model to default",default=False)
+    args = parser.parse_args()
+    create_model(args)
+if __name__=="__main__":
+    main()
+""")
+def create_sub_task(args):
+    """Used to create sub_task within our main task"""
+    sub_task = args.sub_task
+    force_flag = args.force
+    dataset_name = "dataset1"
+    model_name = "model1"
+    sub_task_dir = os.path.join('src',sub_task)
+    data_dir = os.path.join(sub_task_dir,'data')
+    model_dir = os.path.join(sub_task_dir,'model')
+    scripts_dir = os.path.join(sub_task_dir,"scripts")
+    # print(scripts_dir)
+    # deleted old sub_task if force flag exists and sub_task already exists
+    if os.path.exists(sub_task_dir):
+        if force_flag:
+            print("Replacing existing sub_task:",sub_task)
+            shutil.rmtree(sub_task_dir)
+        else:
+            print(f"{sub_task} already exists, use --force flag if you want to reset it to default")
+            exit()
+    # create empty folders
+    os.makedirs(sub_task_dir,exist_ok=True)
+    os.makedirs(data_dir,exist_ok=True)
+    os.makedirs(model_dir,exist_ok=True)
+    os.makedirs(scripts_dir,exist_ok=True)
+    # make config validator file
+    validate_config_file_path = os.path.join(sub_task_dir,"validate_config.py")
+    create_file(validate_config_file_path,
+'''# from cerberus import Validator
+# write config file schema here
+# based on cerberus Validator
+schema = {
+    "seed": {
+        "type": "integer",
+    },
+    "image_size": {"type": "integer", "required": True},
+    "train_size": {"type": "float", "required": True},
+    "shuffle": {"type": "boolean", "required": True},
+    "batch_size": {
+        "type": "integer",
+        "required": True,
+    },
+    "epochs": {
+        "type": "integer",
+        "required": True,
+    },
+}
+''')
+    # make scripts files
+    create_scripts(scripts_dir,sub_task)
+    # make data files
+    create_data(data_dir,dataset_name,sub_task_dir)
+    # make model files
+    create_model(model_dir,model_name,sub_task_dir)
+def main():
+    parser = argparse.ArgumentParser(description="Create blueprint sub_task")
+    parser.add_argument('sub_task',type=str,help="sub_task of project (e.g., simple_regression_colorization)")
+    parser.add_argument("--force",action="store_true",help="forcefully replace old existing sub_task to default",default=False)
+    args = parser.parse_args()
+    create_sub_task(args)
+if __name__=="__main__":
+    main()

src/scripts/prepare_dataset.py ADDED Viewed

	@@ -0,0 +1,31 @@

+import argparse
+from src.utils.config_loader import Config
+from src.utils import config_loader
+from src.utils.script_utils import validate_config
+import importlib
+def prepare_dataset(args):
+    config_file_path = args.config_file
+    config = Config(config_file_path)
+    # validate config
+    validate_config(config)
+    # set config globally
+    config_loader.config = config
+    # now prepare the dataset
+    download_prepare = importlib.import_module(f"src.{config.task}.data.datasets.{config.dataset}").download_prepare
+    print("Preparing dataset")
+    download_prepare()
+    print("Prepared dataset")
+def main():
+    parser = argparse.ArgumentParser(description="Prepare dataset based on config yaml file")
+    parser.add_argument("config_file",type=str)
+    args = parser.parse_args()
+    prepare_dataset(args)
+if __name__=="__main__":
+    main()

src/scripts/visualize_dataset.py ADDED Viewed

	@@ -0,0 +1,29 @@

+import argparse
+from src.utils.config_loader import Config
+from src.utils import config_loader
+from src.utils.script_utils import validate_config
+import importlib
+def visualize_dataset(args):
+    config_file_path = args.config_file
+    config = Config(config_file_path)
+    # validate config
+    validate_config(config)
+    # set config globally
+    config_loader.config = config
+    # now visualize the dataset
+    visualize_fn = importlib.import_module(f"src.{config.task}.data.visualize_dataset").visualize
+    visualize_fn()
+def main():
+    parser = argparse.ArgumentParser(description="Prepare dataset based on config yaml file")
+    parser.add_argument("config_file",type=str)
+    args = parser.parse_args()
+    visualize_dataset(args)
+if __name__=="__main__":
+    main()

src/simple_regression_colorization/data/datasets/forests.py ADDED Viewed

	@@ -0,0 +1,79 @@

+from src.utils.data_utils import download_personal_hf_dataset,unzip_file,is_bw,print_title
+from zipfile import ZipFile
+from pathlib import Path
+from src.utils.config_loader import constants
+from glob import glob
+import shutil,os
+from tqdm import tqdm
+import cv2
+# write dataset downloading preparation code in this file
+# Note: download_prepare() this is specially choosen name so don't change this function's name
+# you can add, remove and change any other function from this file
+def download_prepare():
+    """ function used to download dataset and apply
+        all type of data preprocessing required to prepare the dataset
+    """
+    download_dataset()
+    unzip_dataset()
+    clean_dataset()
+def download_dataset():
+    """Used to download dataset from hugging face"""
+    print_title(f"Downloading forests dataset from hugging face")
+    # download_hf_dataset("")
+    download_personal_hf_dataset("forests")
+def unzip_dataset():
+    print_title(f"Unzipping dataset")
+    unzip_file(constants.RAW_DATASET_DIR/Path("forests/trainval.zip"),
+               constants.INTERIM_DATASET_DIR/Path("trainval/"))
+    unzip_file(constants.RAW_DATASET_DIR/Path("forests/test.zip"),
+               constants.INTERIM_DATASET_DIR/Path("test/"))
+def clean_dataset():
+    print_title("CLEANING DATASET")
+    trainval_dir = constants.INTERIM_DATASET_DIR / Path("trainval/")
+    test_dir = constants.INTERIM_DATASET_DIR / Path("test/")
+    trainval_paths = glob(str(trainval_dir/Path("*")))
+    test_paths = glob(str(test_dir/Path("*")))
+    print("train,test: ",len(trainval_paths),",",len(test_paths),sep="")
+    def clean(image_paths,destination_dir):
+        if os.path.exists(destination_dir): shutil.rmtree(destination_dir)
+        os.makedirs(destination_dir)
+        for i in tqdm(range(len(image_paths))):
+            img = cv2.imread(image_paths[i])
+            img = cv2.resize(img,[128,128])
+            if not is_bw(img):
+                shutil.copy(trainval_paths[i],
+                            destination_dir)
+        print("saved to:",destination_dir)
+    destination_dir = constants.PROCESSED_DATASET_DIR/Path("trainval/")
+    clean(trainval_paths,destination_dir)
+    destination_dir = constants.PROCESSED_DATASET_DIR/Path("test/")
+    clean(test_paths,destination_dir)
+    trainval_dir = constants.PROCESSED_DATASET_DIR / Path("trainval/")
+    test_dir = constants.PROCESSED_DATASET_DIR / Path("test/")
+    trainval_paths = glob(str(trainval_dir/Path("*")))
+    test_paths = glob(str(test_dir/Path("*")))
+    print("after cleaning train,test: ",len(trainval_paths),",",len(test_paths),sep="")

src/simple_regression_colorization/data/register_datasets.py ADDED Viewed

	@@ -0,0 +1,4 @@


1	+ # register your datasets here
2	+
3	+ datasets = ["forests"]
4	+

src/simple_regression_colorization/data/visualize_dataset.py ADDED Viewed

	@@ -0,0 +1,21 @@

+from src.utils.data_utils import show_images_from_paths
+from src.utils.config_loader import constants,config
+from glob import glob
+import numpy as np
+# the data is at constants.PROCESSED_DATASET_DIR/trainval
+#                constants.PROCESSED_DATASET_DIR/test
+def visualize():
+    n = 16
+    image_paths = glob(f"{constants.PROCESSED_DATASET_DIR}/trainval/*")
+    choosen_paths = np.random.choice(image_paths,n)
+    show_images_from_paths(choosen_paths,
+                           title="sample of train_val dataset",
+                           image_size=config.image_size)
+    image_paths = glob(f"{constants.PROCESSED_DATASET_DIR}/test/*")
+    choosen_paths = np.random.choice(image_paths,n)
+    show_images_from_paths(choosen_paths,
+                           title="sample of test dataset",
+                           image_size=config.image_size)

src/simple_regression_colorization/model/base_model_interface.py ADDED Viewed

	@@ -0,0 +1,14 @@

+import numpy as np
+from abc import ABC, abstractmethod
+# BaseModel Abstract class
+# all the models within this sub_task must inherit this class
+class BaseModel(ABC):
+    @abstractmethod
+    def train(self):
+        pass
+    @abstractmethod
+    def predict(self,inputs):
+        pass

src/simple_regression_colorization/model/callbacks.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ # define callbacks here

src/{data/load_dataset.py → simple_regression_colorization/model/dataloaders.py} RENAMED Viewed

@@ -1,15 +1,14 @@
-import os,sys;sys.path.append(os.getcwd())
 import tensorflow as tf
-from src import config
-from src.utils import *
 from pathlib import Path
 from glob import glob
 import sklearn.model_selection
 from skimage.color import rgb2lab, lab2rgb
 def get_datasets():
-    trainval_dir = Path(config.processed_dataset_dir) / Path("trainval/")
-    test_dir = Path(config.processed_dataset_dir) / Path("test/")
     trainval_paths = glob(str(trainval_dir/Path("*")))
     test_paths = glob(str(test_dir/Path("*")))
@@ -22,26 +21,15 @@ def get_datasets():
                                                                     train_size=0.8,
                                                                     random_state=324)
-    print("train|val split:",len(train_paths),"|",len(val_paths))
-    train_ds = get_ds(train_paths,bs=config.batch_size,shuffle=config.shuffle)
-    val_ds = get_ds(val_paths,bs=config.batch_size,shuffle=False,is_val=True)
-    test_ds = get_ds(test_paths,bs=config.batch_size,shuffle=False,is_val=True)
     return train_ds,val_ds,test_ds
-# def test_dataset():
-#     train_ds = get_ds(train_paths,shuffle=False)
-#     L_batch,AB_batch = next(iter(train_ds))
-#     L_batch = L_batch.numpy()
-#     AB_batch = AB_batch.numpy()
-#     print("L:",L_batch.min(),L_batch.max())
-#     print("A:",AB_batch[:,:,:,0].min(),AB_batch[:,:,:,0].max())
-#     print("B:",AB_batch[:,:,:,1].min(),AB_batch[:,:,:,1].max())
 def tf_RGB_TO_LAB(image):
     def f(image):
         image = rgb2lab(image)
@@ -63,7 +51,7 @@ def load_img(img_path):
     L,AB = scale_L(L),scale_AB(AB)
     return L,AB
-def get_ds(image_paths,bs=8,shuffle=False,is_val=False):
     ds = tf.data.Dataset.from_tensor_slices(image_paths)
     if shuffle:   ds = ds.shuffle(len(image_paths))
     ds = ds.map(load_img,num_parallel_calls=tf.data.AUTOTUNE)

 import tensorflow as tf
+from src.utils.data_utils import scale_L,scale_AB,rescale_AB,rescale_L
+from src.utils.config_loader import config
 from pathlib import Path
 from glob import glob
 import sklearn.model_selection
 from skimage.color import rgb2lab, lab2rgb
 def get_datasets():
+    trainval_dir = config.PROCESSED_DATASET_DIR / Path("trainval/")
+    test_dir = config.PROCESSED_DATASET_DIR / Path("test/")
     trainval_paths = glob(str(trainval_dir/Path("*")))
     test_paths = glob(str(test_dir/Path("*")))
                                                                     train_size=0.8,
                                                                     random_state=324)
+    print("train|val|test:",len(train_paths),"|",len(val_paths),"|",len(test_paths))
+    train_ds = get_tf_ds(train_paths,bs=config.batch_size,shuffle=config.shuffle)
+    val_ds = get_tf_ds(val_paths,bs=config.batch_size,shuffle=False,is_val=True)
+    test_ds = get_tf_ds(test_paths,bs=config.batch_size,shuffle=False,is_val=True)
     return train_ds,val_ds,test_ds
 def tf_RGB_TO_LAB(image):
     def f(image):
         image = rgb2lab(image)
     L,AB = scale_L(L),scale_AB(AB)
     return L,AB
+def get_tf_ds(image_paths,bs=8,shuffle=False,is_val=False):
     ds = tf.data.Dataset.from_tensor_slices(image_paths)
     if shuffle:   ds = ds.shuffle(len(image_paths))
     ds = ds.map(load_img,num_parallel_calls=tf.data.AUTOTUNE)

src/simple_regression_colorization/model/losses.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ # define loss functions here

src/simple_regression_colorization/model/metrics.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ # define metrics here

src/simple_regression_colorization/model/models/model_v1.py ADDED Viewed

	@@ -0,0 +1,30 @@

+from src.simple_regression_colorization.model.base_model_interface import BaseModel
+from src.simple_regression_colorization.model.dataloaders import get_datasets
+class Model(BaseModel):
+    def __init__(self):
+        # make model architecture
+        # load weights (optional)
+        # create dataset loaders
+        # train
+        # predict
+        self.init_model()
+        self.load_weights()
+        self.prepare_data()
+    def init_model(self):
+        pass
+    def load_weights(self,path=None):
+        pass
+    def prepare_data(self):
+        self.train_ds,self.val_ds,self.test_ds = get_datasets()
+    def train(self):
+        pass
+    def predict(self,inputs):
+        pass

src/simple_regression_colorization/model/register_models.py ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ # register models of this sub_task here
2	+ models = ["model_v1"]

src/simple_regression_colorization/scripts/create_dataset.py ADDED Viewed

	@@ -0,0 +1,67 @@

+import os,shutil
+import argparse
+def create_file(file_path,file_content):
+    with open(file_path,"w") as f:
+        f.write(file_content)
+def create_dataset(args):
+    dataset_name = args.name
+    force_flag = args.force
+    datasets_dir = os.path.join('src','simple_regression_colorization','data','datasets')
+    os.makedirs(datasets_dir,exist_ok=True)
+    dataset_path = os.path.join(datasets_dir,dataset_name+".py")
+    # deleted old dataset if force flag exists and dataset already exists
+    if os.path.exists(dataset_path):
+        if force_flag:
+            print("Replacing existing dataset:",dataset_name)
+            shutil.remove(dataset_path)
+        else:
+            print(f"{dataset_name} already exists, use --force flag if you want to reset it to default")
+            exit()
+    create_file(dataset_path,
+"""# write dataset downloading preparation code in this file
+# Note: download_prepare() this is specially choosen name so don't change this function's name
+# you can add, remove and change any other function from this file
+def download_prepare():
+    \"\"\" function used to download dataset and apply
+        all type of data preprocessing required to prepare the dataset
+    \"\"\"
+    download_dataset()
+    unzip_dataset()
+    clean_dataset()
+    move_dataset()
+def download_dataset():
+    \"\"\"download dataset\"\"\"
+    pass
+def unzip_dataset():
+    \"\"\"unzip dataset(if required)\"\"\"
+    pass
+def clean_dataset():
+    \"\"\"clean dataset(if required)\"\"\"
+    pass
+def move_dataset():
+    \"\"\"move dataset to processed folder\"\"\"
+    pass
+""")
+def main():
+    parser = argparse.ArgumentParser(description="Create blueprint dataset")
+    parser.add_argument('name',type=str,help="name of dataset (e.g., pascal-voc)")
+    parser.add_argument("--force",action="store_true",help="forcefully replace old existing dataset to default",default=False)
+    args = parser.parse_args()
+    create_dataset(args)
+if __name__=="__main__":
+    main()

src/simple_regression_colorization/scripts/create_model.py ADDED Viewed

	@@ -0,0 +1,46 @@

+import os,shutil
+import argparse
+def create_file(file_path,file_content):
+    with open(file_path,"w") as f:
+        f.write(file_content)
+def create_model(args):
+    model_name = args.name
+    force_flag = args.force
+    models_dir = os.path.join('src','simple_regression_colorization','model',"models")
+    os.makedirs(models_dir,exist_ok=True)
+    model_path = os.path.join(models_dir,model_name+".py")
+    # deleted old model if force flag exists and model already exists
+    if os.path.exists(model_path):
+        if force_flag:
+            print("Replacing existing model:",model_name)
+            shutil.remove(model_path)
+        else:
+            print(f"{model_name} already exists, use --force flag if you want to reset it to default")
+            exit()
+    model_name_camel_case = "".join([part.capitalize() for part in model_name.split("_")])
+    create_file(model_path,
+f"""from src.simple_regression_colorization.model.base_model_interface import BaseModel
+class Model(BaseModel):
+    def train(self):
+        pass
+    def predict(self,inputs):
+        pass
+""")
+def main():
+    parser = argparse.ArgumentParser(description="Create blueprint model")
+    parser.add_argument('name',type=str,help="name of model (e.g., model_v2)")
+    parser.add_argument("--force",action="store_true",help="forcefully replace old existing model to default",default=False)
+    args = parser.parse_args()
+    create_model(args)
+if __name__=="__main__":
+    main()

src/simple_regression_colorization/validate_config.py ADDED Viewed

	@@ -0,0 +1,22 @@

+# from cerberus import Validator
+# write config file schema here
+# based on cerberus Validator
+schema = {
+    "seed": {
+        "type": "integer",
+    },
+    "image_size": {"type": "integer", "required": True},
+    "train_size": {"type": "float", "required": True},
+    "shuffle": {"type": "boolean", "required": True},
+    "batch_size": {
+        "type": "integer",
+        "required": True,
+    },
+    "epochs": {
+        "type": "integer",
+        "required": True,
+    },
+}

src/utils.py DELETED Viewed

@@ -1,39 +0,0 @@
-import yaml
-import numpy as np
-class Config:
-    def __init__(self,path="config.yaml"):
-        with open(path,'r') as f:
-            self.config = yaml.safe_load(f)
-    def __str__(self):
-        return str(self.config)
-    def __getattr__(self, name: str):
-        return self.config.get(name)
-    # def __setattr__(self, name: str, value: any):
-    #     self.config[name]=value
-def is_bw(img):
-    rg,gb,rb = img[:,:,0]-img[:,:,1] , img[:,:,1]-img[:,:,2] , img[:,:,0]-img[:,:,2]
-    rg,gb,rb = np.abs(rg).sum(),np.abs(gb).sum(),np.abs(rb).sum()
-    avg = np.mean([rg,gb,rb])
-    # print(rg,gb,rb)
-    return avg<10
-def print_title(msg:str,n=30):
-    print("="*n,msg.upper(),"="*n,sep="")
-def scale_L(L):
-    return L/100
-def rescale_L(L):
-    return L*100
-def scale_AB(AB):
-    return AB/128
-def rescale_AB(AB):
-    return AB*128

src/utils/config_loader.py ADDED Viewed

	@@ -0,0 +1,23 @@

+import yaml
+from pathlib import Path
+class Config:
+    def __init__(self,config_file_path:str):
+        """loads config from config_file_path"""
+        with open(config_file_path,"r") as f:
+            self.config_dict = yaml.safe_load(f)
+    def __str__(self):
+        return str(self.config_dict)
+    def __getattr__(self,name):
+        return self.config_dict.get(name)
+# exports constants
+constants = Config("constants.yaml")
+constants.config_dict['RAW_DATASET_DIR'] = Path(constants.config_dict['RAW_DATASET_DIR'])
+constants.config_dict['INTERIM_DATASET_DIR'] = Path(constants.config_dict['INTERIM_DATASET_DIR'])
+constants.config_dict['PROCESSED_DATASET_DIR'] = Path(constants.config_dict['PROCESSED_DATASET_DIR'])
+config = None

src/utils/data_utils.py ADDED Viewed

	@@ -0,0 +1,77 @@

+from src.utils.config_loader import constants
+from huggingface_hub import snapshot_download
+from zipfile import ZipFile
+import numpy as np
+import shutil
+import matplotlib.pyplot as plt
+import cv2
+import math
+def download_hf_dataset(repo_id,allow_patterns=None):
+    """Used to download dataset from any public hugging face dataset"""
+    snapshot_download(repo_id=repo_id,
+                    repo_type="dataset",
+                    local_dir=constants.RAW_DATASET_DIR,
+                    allow_patterns=allow_patterns)
+def download_personal_hf_dataset(name):
+    """Used to download dataset from a specific hugging face dataset"""
+    download_hf_dataset(repo_id="Anuj-Panthri/Image-Colorization-Datasets",
+                        allow_patterns=f"{name}/*")
+def unzip_file(file_path,destination_dir):
+    """unzips file to destination_dir"""
+    shutil.rmtree(destination_dir)
+    with ZipFile(file_path,"r") as zip:
+        zip.extractall(destination_dir)
+def is_bw(img:np.ndarray):
+    """checks if RGB image is black and white"""
+    rg,gb,rb = img[:,:,0]-img[:,:,1] , img[:,:,1]-img[:,:,2] , img[:,:,0]-img[:,:,2]
+    rg,gb,rb = np.abs(rg).sum(),np.abs(gb).sum(),np.abs(rb).sum()
+    avg = np.mean([rg,gb,rb])
+    return avg<10
+def print_title(msg:str,max_chars=105):
+    n = (max_chars-len(msg))//2
+    print("="*n,msg.upper(),"="*n,sep="")
+def scale_L(L):
+    return L/100
+def rescale_L(L):
+    return L*100
+def scale_AB(AB):
+    return AB/128
+def rescale_AB(AB):
+    return AB*128
+def show_images_from_paths(image_paths:list[str],image_size=64,cols=4,row_size=5,col_size=5,show_BW=False,title=None):
+    n = len(image_paths)
+    rows = math.ceil(n/cols)
+    fig = plt.figure(figsize=(col_size*cols,row_size*rows))
+    if title:
+        plt.title(title)
+    plt.axis("off")
+    for i in range(n):
+        fig.add_subplot(rows,cols,i+1)
+        img = cv2.imread(image_paths[i])[:,:,::-1]
+        img = cv2.resize(img,[image_size,image_size])
+        if show_BW:
+            BW = cv2.cvtColor(img,cv2.COLOR_RGB2GRAY)
+            BW = np.tile(BW,(1,1,3))
+            img = np.concatenate([BW,img],axis=1)
+        plt.imshow(img.astype("uint8"))
+    plt.show()

src/utils/script_utils.py ADDED Viewed

	@@ -0,0 +1,47 @@

+from cerberus import Validator
+import importlib
+import os
+def validate_config(config):
+    basic_schema = {
+        "task": {
+            "type":"string",
+            "required":True
+        },
+        "dataset": {
+            "type":"string",
+            "required":True
+        },
+        "model": {
+            "type":"string",
+            "required":True
+        },
+    }
+    basic_v = Validator(basic_schema,allow_unknown=True)
+    if not basic_v.validate(config.config_dict):
+        raise Exception(f"Invalid config file:",basic_v.errors)
+    # check if such task exists
+    if not os.path.exists(os.path.join("src",config.task)):
+        raise Exception("Invalid config file:",f"no such task {config.task}")
+    # check if valid dataset
+    all_datasets = importlib.import_module(f"src.{config.task}.data.register_datasets").datasets
+    if config.dataset not in all_datasets:
+        raise Exception("Invalid config file:",f"no {config.dataset} dataset found in registered datasets: {all_datasets}")
+    # check if valid model
+    all_models = importlib.import_module(f"src.{config.task}.model.register_models").models
+    if config.model not in all_models:
+        raise Exception("Invalid config file:",f"no {config.model} model found in registered models: {all_models}")
+    # check the sub_task's validate_config schema
+    task_schema = importlib.import_module(f"src.{config.task}.validate_config").schema
+    sub_task_v = Validator(task_schema,allow_unknown=True)
+    if not sub_task_v.validate(config.config_dict):
+        raise Exception(f"Invalid config file:",sub_task_v.errors)

src/visualization/.gitkeep DELETED Viewed

File without changes