Multi-modal clothing recommendation model based on large model and VAE enhancement

Huang, Bingjie, Lu, Qingyi, Huang, Shuaishuai, Wang, Xue-she, Yang, Haowei

Oct-19-2024–arXiv.org Artificial Intelligence

This contrasts with traditional models that process text in a single direction, and it has been widely demonstrated that BERT effectively captures contextual and semantic relationships in text, thereby providing a more comprehensive understanding of context. The embedding components of BERT include word embeddings, segment embeddings, and position embeddings. In essence, word embeddings map each word individually into a vector within a high-dimensional space. The segment embeddings allow BERT to differentiate and process single texts or pairs of texts, thereby enabling the understanding of semantic information at the sentence level. The position embeddings provide sequential information to the structure, allowing the model to mark the position of words in a sentence, which aids in further processing at a higher level. Finally, the CLS token at the beginning of the input sequence represents the final hidden state in the embedding vector, which is commonly used as the representation of the entire input sequence.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Oct-19-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Texas > Harris County
    - Houston (0.14)
  - Rhode Island > Providence County
    - Providence (0.04)
  - North Carolina > Durham County
    - Durham (0.04)
  - California > Santa Clara County
    - Sunnyvale (0.04)
- Asia > China
  - Anhui Province > Hefei (0.04)

Genre:
- Research Report (0.82)

Industry:
- Health & Medicine > Therapeutic Area (0.54)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Personal Assistant Systems (1.00)
  - Natural Language (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (0.95)
    - Statistical Learning (0.68)