Spark cluster optimization configuration |
iServer's distributed analysis service is based on the Spark computing platform, providing GIS distributed analysis and processing capabilities. Different hardware environments, Spark cluster environments, and analysis data of different size all affect the performance of distributed analysis. To achieve the best performance, you need to optimize the configuration according to different scenarios. Here are some commonly used optimization methods:
Enter the conf directory of the Spark installation package, such as [iServer installation directory]\support\spark\conf, rename the spark-defaults.conf.template file to spark-defaults.conf
Open spark-defaults.conf with document editor, add: spark.executor.memory 8g
Modify the spark driver memory
Enter the conf directory of the Spark installation package, such as [iServer installation directory]\support\spark\conf, rename the spark-defaults.conf.template file to spark-defaults.conf
Open spark-defaults.conf with document editor, add: spark.driver.memory 5g
If you are using the built-in spark in iServer, you can set the Spark master node not to participate in the analysis task
Click "Cluster", "Join cluster" in the iServer where the spark master node is located, cancel the cluster reporter in Whether to be the Distributed Analysis node.
Enter the conf directory of the Spark installation package, such as [iServer installation directory]\support\spark\conf, rename the spark-defaults.conf.template file to spark-defaults.conf
Open spark-defaults.conf with document editor, add: spark.scheduler.mode FIFO