Review ItemSAGE Paper
Transformer-based, multi-modality recsys from Pinterest
My attempt to keep up with the latest development in the AI/ML world. Research area: NLP, Deep learning for Recommender System, Computer vision
Transformer-based, multi-modality recsys from Pinterest
Proof that Transformer could work with Vision
Meta released a SOTA LMM for the research community
Foundational promptable image segmentation model
Generative model G + discriminative model D
Generative model G + discriminative model D
Advancement for efficient training of Denoising Diffusion models
Parameter-Efficient fine-tuning for LLMs
Basics of contrastive learning
Using momentum for the teacher and balancing between centering and sharpening to avoid mode collapse
Review the Bert paper from Google
Vision’s BERT - Everything is a dog
General-purpose Visual-Natural Language aligned encoder
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Affordances from Human Videos as a Versatile Representation for Robotics
Domain-Specific Batch Norm (DSBN) for Unsupervised Adaptation