Page 1 of 1

Issues importing large amounts of records in batch job

Posted: August 16th, 2022, 10:38 pm
by CathyC
I was talking with Matt and we both had situations where we needed to import many records and ran into design issues in the batch job.
I know the recommended approach is to save to db (described in this forum post). Below is the summary of the approaches tried and how they turned out
Ideally all approaches should have some way to work, though approach 1 and 3 would not be recommended for data that could change.
Can I get some feedback on a setup for approaches 1 and 2 that would not run into these issues?

Approach #1: RunQuery Paging
Using the paging feature for RunQuery by setting Paging = True, # of records per page = 50
Then set up the canvas so it looped back to RunQuery until EOF = True
This approach meant that it would have multiple queries against JDE
Failed: hung at ~30k records and caused the whole batch server to hang

Approach #2: Save to Db
Using the save to db feature of a RunQuery and retrieving using the CreateServiceNodeFromDatabase.

Platform3 App1: pulling 50 records at a time, saving to database, ECC timeout 2400000 (40min), Data Size 9.77 KB.
Tried to pull in over 10k records and it would timeout. Eventually the requirements were changed and more filters were added so only 2k records were needed, so this was never solved.

Platform3 App2: same variables. 50 records at a time, saving to database, ECC timeout 240000, data size 9.77 KB.
Tried to pull in 16k records, but get an ECC timeout message:
An error occurred in the ItemQuery RunQuery service: No message received from ECC for application : 100093 (Connector Application : 100053). Timeout: 2400000 ms

Attempt 3: 50 records at a time, data size: 1MB, 10 minute timeout.
Needed to pull in 191k records, but lowered it to 10k and it still timed out.

Approach #3: Run Batch Job at regular frequency and resume progress
run for 1k at a time and let the batch job run every minute and resume at the last ID
keeping track of the last ID that was processed and doing a ID > lastId in the Query with a top 1000
This worked

Re: Issues importing large amounts of records in batch job

Posted: August 18th, 2022, 11:33 am
by CameronDouthitt
Recent development from Platform2 on 5.1 classic:

They were currently using the approach #1 for their AddressImport and TopSellingItems batch jobs.

The issue found was that the query that was being looped on never reach EOF. So the batch job would run indefinitely until the server ran out of resources. Especially because both of these would run at the same time each day.

Solution:
I had to change these over to use approach #2. This is current best practice.

Conclusion:
1. When looping over any set of data that has the potential to get large in a batch job, approach #2 should be the method used.
2. Make sure to set the ECC timeout(if JDE through ECC) to a high enough setting so the connection to JDE stays open long enough to read all data from the query in one go.
3. Make sure to set the data size high enough that it can receive all data that is coming from the query.

Re: Issues importing large amounts of records in batch job

Posted: August 18th, 2022, 11:46 am
by CameronDouthitt
Successful uses from Platform2 update:

These successful batch jobs were done using approach #2.

Job 1:
- Loop1: pulling 25 records at a time from SaveToDB. Actual count 671. ECC timeout(none-application query). Data Size 97.66KB .
Job 2:
- Loop1: pulling 50 records at a time from SaveToDB. Actual count ~2000. ECC timeout(600000). Data Size 9.54 MB
- Loop2: pulling 50 records at a time from SaveToDB. Actual count ~1500. ECC timeout(600000). Data Size 976.56 KB

All of these loop setups are working successfully to resolve the initial issues found when using approach #1.

Re: Issues importing large amounts of records in batch job

Posted: August 19th, 2022, 6:40 am
by SumanPrasad
Task 10697 : Solve ECC timeout issue
Task 10698 : Solve Batch job loop issues

Re: Issues importing large amounts of records in batch job

Posted: September 16th, 2022, 2:48 pm
by JustinVanRegenmorter
Task 10698 has been resolved in RunTime Revision 1065.

Re: Issues importing large amounts of records in batch job

Posted: October 7th, 2022, 2:27 pm
by darshanP
Had an issue with the Item import query. I was seeing an error - 'Unable to read data from the transport connection.'
I changed the inquire service datasize to 512000 from 10000 and the timeout is set to 2400000. That fixed the error and was able to get 16220 records.