|
|
vor 2 Jahren | |
|---|---|---|
| config/db | vor 2 Jahren | |
| spark-packages | vor 3 Jahren | |
| src/spark | vor 2 Jahren | |
| .gitignore | vor 2 Jahren | |
| README.md | vor 3 Jahren | |
| bench.py | vor 2 Jahren | |
| clean.py | vor 3 Jahren | |
| settings.json | vor 3 Jahren | |
| setup.py | vor 2 Jahren | |
| small_test_data.csv | vor 3 Jahren | |
| start_services.sh | vor 3 Jahren | |
| submit.sh | vor 3 Jahren | |
| submit_graph.sh | vor 3 Jahren | |
| submit_partition.sh | vor 3 Jahren |
TODO
For the graph implementation specifically you need to install graphframes manually from a third party since the official release is incompatible with spark 3.x (pull request pending). A prebuilt copy is supplied in the spark-packages directory.
settings.json to reflect your setup. If you are running everything locally you can use start_services.sh to turn everything on in one swoop. It might take a few minutes for Cassandra to become available.python3 setup.py from the project root. Per default this will move small_test_data.csv into the transactions table.submit.sh (slow) or submit_graph.sh (faster)python3 clean.py. Be wary that this wipes all table definitions and data.