🎬 Fine-Tuned Movie Retriever (Rich Semantic & Metadata Queries + Smart Negatives)

Model

This is a custom fine-tuned sentence-transformer model designed for movie and TV recommendation systems. Optimized for high-quality vector retrieval in a movie and TV show recommendation RAG pipeline. Fine-tuning was done using ~32K synthetic natural language queries across metadata and vibe-based prompts:

  • Enriched vibe-style natural language queries (e.g., Emotionally powerful space exploration film with themes of love and sacrifice.)
  • Metadata-based natural language queries (e.g., Any crime movies from the 1990s directed by Quentin Tarantino about heist?)
  • Smarter negative sampling (genre contrast, theme mismatch, star-topic confusion)
  • A dataset of over 32,000 triplets (query, positive doc, negative doc)

🧠 Training Details

  • Base model: BAAI/bge-base-en-v1.5
  • Loss function: MultipleNegativesRankingLoss
  • Epochs: 4
  • Optimized for: top-k semantic retrieval in RAG systems

πŸ“ˆ Evaluation: Fine-tuned vs Base Model

Metric Fine-Tuned Model Score Base Model Score
Recall@1 0.456 0.214
Recall@3 0.693 0.361
Recall@5 0.758 0.422
Recall@10 0.836 0.500
MRR 0.595 0.315

Evaluation setup:

  • Dataset: 3,598 held-out metadata and vibe-style natural queries
  • Method: Top-k ranking using cosine similarity between query and positive documents
  • Goal: Assess top-k retrieval quality in recommendation-like settings

πŸ“¦ Usage

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("jjtsao/fine-tuned_movie_retriever-bge-base-en-v1.5")
query_embedding = model.encode("mind-bending sci-fi thrillers from the 2000s about identity")

πŸ” Ideal Use Cases

  • RAG-style movie recommendation apps
  • Semantic filtering of large movie catalogs
  • Query-document reranking pipelines

πŸ“œ License

Apache 2.0 β€” open for personal and commercial use.

Downloads last month
3
Safetensors
Model size
109M params
Tensor type
F16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for JJTsao/fine-tuned_movie_retriever-bge-base-en-v1.5-fp16

Finetuned
(426)
this model

Evaluation results