Resume Recommender

You probably already know transformers have a maximum sequence length, and you probably also know that by default anything beyond this limit is ignored. But what happens if you can’t afford to truncate the inputs? You’re probably looking at a sliding window approach, where you extract subsets of text and process each in turn.

Normally I just split on whitespace, but I’ve been interested in trying out NLTK’s sentence tokenizer. Recommending resumes for a free-form text query is as good an excuse as any. 🙂