gpt-2-output-dataset


  • This dataset contains: - 250K documents from the WebText test set - For each gpt-2 model (trained on the WebText training set), 250K random samples (temperature 1, no truncation) and 250K samples generated with Top-K 40 truncation