In this RECOVER study, researchers wanted to update a smart computer program, called a machine learning pipeline, to better identify people with Long COVID. In 2021, the first version of the program, called LCM 1, was created to identify people with or likely to have Long COVID. LCM 1 depended on people having a COVID-19 diagnosis date in their electronic health records (EHR). This meant LCM 1 could miss people who may have taken a COVID-19 test at home. LCM 1 also did not look at information about whether people got COVID-19 more than once. To improve the program and create a new version called LCM 2, researchers used more than 5 million EHRs from a large set of data called the National COVID Cohort Collaborative (N3C). They taught the program to look at a person’s health information over many years, not just starting from their first recorded COVID-19 diagnosis. Researchers found that LCM 2 was very accurate. They used it to estimate that about 1 in 10 people in the database who had COVID-19 went on to develop Long COVID. This study is important because it shows that older machine learning models, like LCM 1, can be updated to keep up with the way an illness is tracked and diagnosed over time. This can help other researchers improve their machine learning models to produce more accurate findings.