
After building the SaguaroChem dataset by extracting chemical reaction data from the patent literature, we took a step back to evaluate how we could improve our extraction pipeline – specifically aiming to enhance the coverage, quality, and format of the data we extract from chemical synthesis documents. With a solid foundation in patents, we set our sights on a new and