[PDF][PDF] FicSim: An Ethically Constructed Dataset for Long-Context Semantic Similarity Comparison within Fiction

N Johnson, A Bertsch, E Strubell - creativity-ai.github.io
As language models continue to advance in their ability to process long and complex texts,
there has been growing interest in their application within computational literary studies …