Google cloud commands

BUCKET_NAME=gs://${USER}_yt8m_train_bucket

Training:

The parameters to be modified are from the "base_learning_rate" to "start_new_model":

TRAIN_DIR: "yt8m_name_simulation" is the name of the simulation (and the folder where it will be saved). For another simulation, give another meaningful name, depending on what you are trying to do.
base_learning_rate: Which learning rate to start with
batch_size
reg_lambda: controls the proportions between the two loss functions. If it is big, the classification loss will be bigger. It only trains correctly (at least what I've tried) when it starts being zero. Then you can change it
percentage_negative: Percentage of negative samples (from 0 to 1)
margin: related to the cosine loss. If margin is high (up to 1), the cosine loss does not punish negative embeddings with a cosine distance lower than margin.
start_new_model: If you want to continue with the previous simulation (that has the same train_dir), it has to be False. If not, the simulation is overwritten.

Other useful modifications can be done in two files:

video_level_models.py, in the EmbeddingModel class. You can change anything you want, as long as the dimensions are correct (audio features always 128, video features always 1024). You can add more layers, change the size of the hidden layers...
losses.py, in the CosineAndCrossEntropyLoss class.

TRAIN_DIR=yt8m_name_simulation
JOB_NAME=yt8m_train_$(date +%Y%m%d_%H%M%S); gcloud --verbosity=debug ml-engine jobs submit training $JOB_NAME \
--package-path=youtube-8m --module-name=youtube-8m.train --staging-bucket=$BUCKET_NAME \
--region=us-east1 --config=youtube-8m/cloudml-gpu.yaml \
-- --train_data_pattern='gs://youtube8m-ml-us-east1/1/video_level/train/train*.tfrecord' \
--train_dir=$BUCKET_NAME/${TRAIN_DIR} \
--base_learning_rate=0.0001 \
--batch_size=1024 \
--reg_lambda=0.0 \
--percentage_negative=0.6 \
--margin=0.2 \
--start_new_model=False \
--negative_sampling=True \
--model=EmbeddingModel \
--select_randomly=False \
--feature_names="mean_rgb, mean_audio" \
--feature_sizes="1024, 128" \
--num_readers=8 \
--image_server=False \
--label_loss="CosineAndCrossEntropyLoss" \
--hid_1_audio=450 \
--hid_2_audio=250 \
--hid_1_frames=2000 \
--hid_2_frames=700 \
--embedding_size=250

I also introduce the commands I use to execute it on our server, which are very similar. The image_server flag is there because I had some problems when reading the files in the server, but in principle you should put it to False:

TRAIN_DIR=yt8m_name_simulation
srun --gres=gpu:1 --mem=2G python train.py --train_data_pattern='path_to_training_data/train*.tfrecord' \
--model=EmbeddingModel \
--select_randomly=False \
--feature_names="mean_rgb, mean_audio" \
--feature_sizes="1024, 128" \
--train_dir=$MODEL_DIR/${TRAIN_DIR} \
--batch_size=254 \
--num_readers=8 \
--start_new_model=True \
--max_steps=30000 \
--image_server=True \
--num_epochs=50 \
--base_learning_rate=0.0005 \
--label_loss="CosineAndCrossEntropyLoss" \
--negative_sampling=True \
--reg_lambda=0.00 \
--margin=0.2 \
--percentage_negative=0.6 \
--export_model_steps=100 \
--learning_rate_decay_examples=1000 \
--learning_rate_decay=0.7
--hid_1_audio=450 \
--hid_2_audio=250 \
--hid_1_frames=2000 \
--hid_2_frames=700 \
--embedding_size=250

Evaluation (Validation):

Meaningful parameters:

JOB_TO_EVAL: it has to be the same as the TRAIN_DIR used in the training.
batch_size: number of features among which the closest embedding will be looked for.
hits: represents the "k" in Recall@k
max_batches: number of batches you want to evaluate (then the mean result is provided)

If you want to create new evaluation metrics, the files you have to change are "eval" and "eval_util". In the second one you can create a function such as "calculate_hit_at_k_embedding", to evaluate whatever you want, having the embeddings of all the samples of the batch as input. Then you just have to see where the function "calculate_hit_at_k_embedding" is called, and do the same for your new function.

JOB_TO_EVAL=yt8m_name_simulation
BOARD=yt8m_name_simulation_board
JOB_NAME=yt8m_eval_$(date +%Y%m%d_%H%M%S); gcloud --verbosity=debug ml-engine jobs \
submit training $JOB_NAME \
--package-path=youtube-8m --module-name=youtube-8m.eval \
--staging-bucket=$BUCKET_NAME --region=us-east1 \
--config=youtube-8m/cloudml-gpu.yaml \
-- --eval_data_pattern='gs://youtube8m-ml-us-east1/1/video_level/validate/validate*.tfrecord' \
--hits=1 \
--batch_size=256 \
--max_batches=1 \
--model=EmbeddingModel \
--select_randomly=False \
--feature_names="mean_rgb, mean_audio" \
--feature_sizes="1024, 128" \
--label_loss="CosineAndCrossEntropyLoss" \
--train_dir=$BUCKET_NAME/${JOB_TO_EVAL} \
--run_once=True \
--board_dir=$BUCKET_NAME/${BOARD} 
--hid_1_audio=450 \
--hid_2_audio=250 \
--hid_1_frames=2000 \
--hid_2_frames=700 \
--embedding_size=250

And this is the command used for running it on our server:

JOB_TO_EVAL=yt8m_name_simulation
srun --gres=gpu:1 --mem=5G python eval.py --eval_data_pattern='path_to_validation_data/validate*.tfrecord' \
--model=EmbeddingModel \
--train_dir=$MODEL_DIR/${JOB_TO_EVAL} \
--run_once=True \
--max_batches=1 \
--select_randomly=False \
--feature_names="mean_rgb, mean_audio" \
--feature_sizes="1024, 128" \
--batch_size=1024 \
--image_server=True \
--label_loss="CosineAndCrossEntropyLoss" \
--hits=10
--hid_1_audio=450 \
--hid_2_audio=250 \
--hid_1_frames=2000 \
--hid_2_frames=700 \
--embedding_size=250

Name		Name	Last commit message	Last commit date
Latest commit History 238 Commits
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
average_precision_calculator.py		average_precision_calculator.py
both_stem.png		both_stem.png
cloudml-gpu-distributed.yaml		cloudml-gpu-distributed.yaml
cloudml-gpu.yaml		cloudml-gpu.yaml
convert_prediction_from_json_to_csv.py		convert_prediction_from_json_to_csv.py
eval.py		eval.py
eval_util.py		eval_util.py
export_model.py		export_model.py
frame_level_models.py		frame_level_models.py
inference.py		inference.py
losses.py		losses.py
mean_average_precision_calculator.py		mean_average_precision_calculator.py
model_utils.py		model_utils.py
models.py		models.py
only_audio_stem.png		only_audio_stem.png
only_frames_stem.png		only_frames_stem.png
readers.py		readers.py
train.py		train.py
utils.py		utils.py
video_level_models.py		video_level_models.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Google cloud commands

Training:

Evaluation (Validation):

About

Releases

Packages

Languages

License

Lidaguo/youtube-8m

Folders and files

Latest commit

History

Repository files navigation

Google cloud commands

Training:

Evaluation (Validation):

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages