Detailed information for repository usage.
Open sync_nlx
and set the variables throughout the file correctly.
Run the script to scrap the database,
pipenv run sync_nlx.py
The original code compresses and pushes the flat files to S3, this version does not. It can be easily copy and pasted to add that functionality by going to the original-code
branch.
Compress your flat file (gz preferably) and send it wherever you need to.
If instead you want to create the schema.org files continue to the next step without compressing.
Set the correct variables in nlx.py
Run the application
pipenv run nlx.py
Some organizations may have trouble opening the csv files produced.
In that case you can try ASCII encoding the file and setting the row delimieter to CRLF with the following commands,
cat 2018.csv | iconv -c -t ASCII//IGNORE > 2018_ascii_dos.csv
unix2dos 2018_ascii_dos.csv