'prioritize a specific feature while embedding for recommendation system

I am trying to build a content based movie recommendation system. Say I am using fields are, Director, Crew, Genre. Procedure as follows,

  1. Concatenate all the features as 'Director' + ' '+ 'Crew' + ' '+ 'Genre'.
  2. Count Vectorizer of this concatenated feature.
  3. Calculate similarity score
  4. Order then according to the similarity score.

Here all the features have same priority.

Now I want to prioritize the 'Director'. Say I want the recommend movies tends to be have same Director.



Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source