BLIP for RSICD image captioning:
blip-image-captioning-basemodel has been finetuned on thersicddataset. Training parameters used are as follows:- learning_rate = 5e-7
- optimizer = AdamW
- scheduler = ReduceLROnPlateau
- epochs = 5
- More details (demo, testing, evaluation, metrics) available at
github repo
- Downloads last month
- 25