Reader-LM: Small Language Models for Cleaning and Converting HTML to Markdown

 submit   Problem Solving Skills, Human vs AI

Published Sep 12 '24. Last edited Feb 11 '25

Fact  

tried using the model to extract full text of product release using Google Colab notebook, the result is that it extracted about 20% of the full text and stopped there, here is reader_lm_tutorial.ipynb

full text available (18461 bytes)

 

Terms of Use: You are in agreement with our Terms of Services and Privacy Policy. If you have any question or concern to any information published on SaveNowClub, please feel free to write to us at savenowclub@gmail.com