NJUST KMG

School of Computer Science and Engineering,
Nanjing University of Science & Technology

Research tools notes

Code

Our Codes are available on our GitHub account njustkmg.

Welcome to use OMML : Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

Baidu paddle developers 《Multimodal learning research and toolkit PaddleMM introduction and application》, [BiLibili],
PaddlePaddle Multi-modal learning toolkit based on PaddlePaddle: PaddleMM, Kaggle Contest Book[PaddleMM],
Our mulan-paddlemm , which we partnered with PaddlePaddle, has now joined the Mulan community. [PaddleMM officially entered the Mulan open source community for incubation] [The 11th TOC meeting of the Mulan Open Source Community in 2022] [The 11th TOC meeting of the Mulan Open Source Community in 2022]

Demo

We have developed a website that can identify its authenticity and branding, Ysneaker, the main functions are as follows.

Multimodal Fusion Classification

Cross-modal Style Transfer

Cross-modal Retrieval

-->

We have developed a website for garbage sorting and detecting new categories of Garbage Classification Website, the main functions are as follows.

Detection of new class garbage in open scenarios

We have developed a website for generating descriptions and description enhancements for home improvement images Home Description, , the main functions are as follows.

Enhance the semantic description of home improvement pictures

Optimize the semantic description of home improvement pictures

NJUST KMG

School of Computer Science and Engineering, Nanjing University of Science & Technology

Code

Demo

School of Computer Science and Engineering,
Nanjing University of Science & Technology