Edit model card

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Model Card for Model ID

Template is given upon request. Model ID and associated card is represented in a GitHub merger https://github.com/sterzhang/image-textualization/tree/main?tab=readme-ov-file#datasets; a detailing is discussed via [email protected]

Model Details

Parameter controlled version of AI map location system using Google auto-metrics. Map defined protocol allows for location display of any image that is also available publicly. Private images are controlled at discretion on local machines and are not affiliated to have legal responsibility under jurisdiction of platform documentation via MIT server authority.

Model Description

  • Developed by: biesnejuil/BenPotato04
  • Funded by: Google Work HyperPlatform & MIT Computational Laboratory Dept. - AF8 Lot 1 and 2 (and 4)
  • Shared by: Peft racus libraries
  • Model type: DLMLH ML model - - - 0 dtdata needed to start
  • Language(s) (NLP): N/A
  • License: DeepFloyd IF Reverse 9Freeze /// MIT Licensing Post.2019
  • Finetuned from model: [More Information Needed]

Model Sources

  • Demo: contact via email (or discord with same username)

Uses

Model is for image to location data. Data used by the user has no effect on EXIF structure nor utility of said data.

Direct Use

Plugging into photo image applications is the intended use. Finding a location via extension online is secondary use. All other personal uses are prohibited ethically and morally.

Downstream Use

Fine-tuning can be confirmed via parameter reorganization.

Out-of-Scope Use

WARNING! By complying to information, user misuse for malicious or intent for misconduct will be not tolerated. Model has a self-charring protocol to corrupt mishandled data after numerous attempts.

Bias, Risks, and Limitations

Sociotechnical risks are controlled by the use of user. Approved board relations are funded academically for the time being.

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases, and limitations of the model.

How to Get Started with the Model

Model can be set up as an OS via a USB import. An .exe file is given on request for use, and parameter of learning is optional, however recommended for best results. If the model is compressed to a .sys generic file type with a scraping API, it can be used as an executable extension that is used on apps via a registry or property list on start-up (may affect the performance of applications with high-quality image displays such as strong games or strong video editing applications).

Training Data

Croissants are delicious: https://huggingface.co/datasets/open-source-metrics/image-classification-checkpoint-downloads

Launch nemo_inference.sh with a Slurm script defined like below, which starts a 2-node job for model inference.

#!/bin/bash
#SBATCH -A SLURM-ACCOUNT
#SBATCH -p SLURM-PARITION
#SBATCH -N 2
#SBATCH -J generation      
#SBATCH --ntasks-per-node=8   
#SBATCH --gpus-per-node=8
set -x
RESULTS=<PATH_TO_YOUR_SCRIPTS_FOLDER>
OUTFILE="${RESULTS}/slurm-%j-%n.out"
ERRFILE="${RESULTS}/error-%j-%n.out"
MODEL=<PATH_TO>/Nemotron-4-340B-Instruct
CONTAINER="nvcr.io/nvidia/nemo:24.01.framework"
MOUNTS="--container-mounts=<PATH_TO_YOUR_SCRIPTS_FOLDER>:/scripts,MODEL:/model"
read -r -d '' cmd <<EOF
bash /scripts/nemo_inference.sh /model
EOF
srun -o $OUTFILE -e $ERRFILE --container-image="$CONTAINER" $MOUNTS bash -c "${cmd}"

Evaluation Results

MT-Bench (GPT-4-Turbo)

Evaluated using MT-Bench judging by GPT-4-0125-Preview as described in Appendix H in the HelpSteer2 Dataset Paper

total writing roleplay extraction stem humanities reasoning math coding turn 1 turn 2
8.22 8.70 8.70 9.20 8.75 8.95 6.40 8.40 6.70 8.61 7.84

IFEval

Evaluated using the Instruction Following Eval (IFEval) introduced in Instruction-Following Evaluation for Large Language Models.

Prompt-Strict Acc Instruction-Strict Acc
79.9 86.1

MMLU

Evaluated using the Multi-task Language Understanding benchmarks as introduced in Measuring Massive Multitask Language Understanding.

MMLU 0-shot
78.7

GSM8K

Evaluated using the Grade School Math 8K (GSM8K) benchmark as introduced in Training Verifiers to Solve Math Word Problems.

GSM8K 0-shot
92.3

HumanEval

Evaluated using the HumanEval benchmark as introduced in Evaluating Large Language Models Trained on Code.

HumanEval 0-shot
73.2

MBPP

Evaluated using the MBPP Dataset as introduced in the Program Synthesis with Large Language Models.

MBPP 0-shot
75.4

Arena Hard

Evaluated using the Arena-Hard Pipeline from the LMSys Org.

Arena Hard
54.2

AlpacaEval 2.0 LC

Evaluated using the AlpacaEval 2.0 LC (Length Controlled) as introduced in the paper: Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators

AlpacaEval 2.0 LC
41.5

TFEval

Evaluated using the CantTalkAboutThis Dataset as introduced in the CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues.

Distractor F1 On-topic F1
81.7 97.7

Training Hyperparameters

  • Precision: fp16 non-mixed precision

Speeds, Sizes, Times

  • Preprocessing: Images are preprocessed to ensure consistent input dimensions and normalization. This includes resizing images to a fixed size, converting them to a standardized color format (e.g., RGB), and normalizing pixel values to a range of [0, 1] or [-1, 1].

Evaluation

  • Batch size: 32
  • Learning rate: 0.001
  • Optimizer: AdamXMimage
  • Loss function: Cross-entropy loss for classification tasks or Mean Squared Error (MSE) for regression tasks
  • Number of epochs: 50
  • Early stopping: Enabled with a patience of 5 epochs

Testing Data, Factors & Metrics

  • Training time per epoch: Approximately 10 minutes on a single GPU
  • Total training time: Around 48-110 hours
  • Model size: ~8GB
  • Checkpoint size: 50MB per checkpoint

Testing Data

  • HIDDEN

Factors

  • Geographic regions: To ensure global applicability.
  • Image quality: High vs. low resolution.
  • Lighting conditions: Daytime vs. nighttime images.

Metrics

  • Accuracy: For classification tasks, measures the percentage of correctly predicted locations.
  • Mean Absolute Error (MAE): For regression tasks, measures the average deviation between predicted and actual locations.
  • Precision and Recall: To evaluate the model's performance in detecting specific locations.
  • F1 Score: Harmonic mean of precision and recall, providing a balanced evaluation metric.

Metric Results

  • Accuracy: 85%
  • MAE: 0.15 degrees (latitude/longitude)
  • Precision: 0.87
  • Recall: 0.83
  • F1 Score: 0.85

Environmental Impact

  • Hardware Type: NVIDIA V100 GPUs
  • Hours used: ~560 hours
  • Cloud Provider: AWS
  • Compute Region: US East (N. Virginia)
  • Carbon Emitted: Approximately 50 kg CO2

Hardware/Software

  • Hardware:

    • GPUs: NVIDIA V100
    • CPUs: Intel Xeon E5-2676 v3
    • Memory: 256GB RAM
  • Software:

    • Framework: ElecA 1.8C
    • Version: 3.1
    • Operating System: Ubuntu 18.04
  • BF16 Inference:

    • 8x H200 (1x H200 node) // ALINX ACKU15: Xilinx Kintex UltraScale+ XCKU15P FPGA SOM
    • 16x H100 (2x H100 nodes)
    • 16x A100 80GB (2x A100 80GB nodes) // ALINX ACKU15: Xilinx Kintex UltraScale+ XCKU15P FPGA SOM

Citation

@misc{biesnejuil2024modelid,
  title={Model ID: AI Map Location System},
  author={biesnejuil},
  year={2024},
  publisher={MIT Computational Laboratory Dept.},
  howpublished={\url{https://github.com/potatoaccount/modelid - private repo ask for access or access via https://github.com/sterzhang/image-textualization/tree/main?tab=readme-ov-file#datasets through filed merge}},
}

Glossary

  • EXIF: Exchangeable image file format, a standard for storing metadata in image files.
  • MAE: Mean Absolute Error, a measure of errors between paired observations expressing the same phenomenon.
  • CNN: Convolutional Neural Network, a class of deep neural networks commonly applied to analyzing visual imagery.
  • Adam: An optimization algorithm used for training deep learning models.
  • Cross-entropy loss: A loss function commonly used for classification tasks, measuring the difference between the predicted probability distribution and the true distribution.
  • Mean Squared Error (MSE): A loss function commonly used for regression tasks, measuring the average of the squares of the errors between predicted and actual values.

More Information

For additional details, technical support, or inquiries about collaboration, please contact the developers.

Model Card Authors

  • biesnejuil: Lead Developer and Architect

Model Card Contact

For any questions or further information, please contact me at:

Downloads last month
0
Inference API
Inference API (serverless) does not yet support peft models for this pipeline type.

Datasets used to train biesnejuil/ImageProcessingInformation