question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

quickstart fail ingest error during install

See original GitHub issue

Hi,

I have tried a few things and feel I have configured my machine in a way that reflects what is specified in the quickstart guide, but I am running into a few issues. I can complete the install with “success” but once the web test starts, wildfly goes up and then I am stuck with the following until timeout:

[DW-INFO] - Polling for EAR deployment status every 4 seconds (15 attempts max) – Wildfly process not found (1/15) ± Wildfly up (132683). EAR deployment pending (2/15)

Centos 7 64cores, 64GB ram

$ java -version
openjdk version "1.8.0_262"
OpenJDK Runtime Environment (build 1.8.0_262-b10)
OpenJDK 64-Bit Server VM (build 25.262-b10, mixed mode)

$ ulimit -u
32768

$ ulimit -n
32768

$ sudo sysctl vm.swappiness=0

$ git clone https://github.com/NationalSecurityAgency/datawave.git
$ echo "source my_path/contrib/datawave-quickstart/bin/env.sh" >> ~/.bashrc
$ source ~/.bashrc
$  allInstall


Potentially relevant error: Exception in thread “main” java.io.FileNotFoundException: File does not exist: hdfs://localhost:9000/datawave/ingest/work/jobCacheB/datawave-ws-common-util-4.0.0-SNAPSHOT.jar at org.apache.hadoop.hdfs.DistributedFileSystem$29.doCall(DistributedFileSystem.java:1587) at org.apache.hadoop.hdfs.DistributedFileSystem$29.doCall(DistributedFileSystem.java:1580) at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81) at org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1595) at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.getFileStatus(ClientDistributedCacheManager.java:325) at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.getFileStatus(ClientDistributedCacheManager.java:236) at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.determineTimestamps(ClientDistributedCacheManager.java:111) at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.determineTimestampsAndCacheVisibilities(ClientDistributedCacheManager.java:69) at org.apache.hadoop.mapreduce.JobResourceUploader.uploadResourcesInternal(JobResourceUploader.java:220) at org.apache.hadoop.mapreduce.JobResourceUploader.uploadResources(JobResourceUploader.java:133) at org.apache.hadoop.mapreduce.JobSubmitter.copyAndConfigureFiles(JobSubmitter.java:99) at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:194) at org.apache.hadoop.mapreduce.Job$11.run(Job.java:1570) at org.apache.hadoop.mapreduce.Job$11.run(Job.java:1567) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:422) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1729) at org.apache.hadoop.mapreduce.Job.submit(Job.java:1567) at datawave.ingest.mapreduce.job.IngestJob.run(IngestJob.java:377) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:76) at datawave.ingest.mapreduce.job.IngestJob.main(IngestJob.java:209) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:498) at org.apache.hadoop.util.RunJar.run(RunJar.java:318) at org.apache.hadoop.util.RunJar.main(RunJar.java:232) [DW-WARN] - The IngestJob class encountered errors (exit status: 1). See job log above for details

Post install:

====== Hadoop Status ====== pids: 103749 104126 104760 105121 105737 [DW-INFO] - NodeManager => 105121 [DW-INFO] - NameNode => 103749 [DW-INFO] - JobHistoryServer => 105737 [DW-INFO] - ResourceManager => 104760 [DW-INFO] - DataNode => 104126 [DW-WARN] - SecondaryNameNode is not running ====== Accumulo Status ====== pids: 106905 107002 107283 107361 107415 [DW-INFO] - gc => 107361 [DW-INFO] - master => 107283 [DW-INFO] - tracer => 107415 [DW-INFO] - monitor => 106905 [DW-INFO] - tserver => 107002 ====== ZooKeeper Status ====== [DW-INFO] - ZooKeeper => 106797 ====== DataWave Ingest Status ====== [DW-INFO] - No ingest processes are running ====== DataWave Web Status ====== [DW-WARN] - Wildfly is not running

Issue Analytics

  • State:open
  • Created 3 years ago
  • Comments:9

github_iconTop GitHub Comments

1reaction
abazabaaacommented, Mar 21, 2021

This might have already been dealt with under https://github.com/NationalSecurityAgency/datawave/issues/672

Trying this…

0reactions
abazabaaacommented, Mar 21, 2021

For step 4 on: https://code.nsa.gov/datawave/docs/2.9/tour/ingest-basics#step-1-define-the-data-type

$ ./ingest-tv-shows.sh --download-only --outfile ~/more-tv-shows.json [DW-INFO] - Writing json records to /home/schrogpu/more-tv-shows.json [DW-INFO] - Downloading show data: ‘Veep’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘Game of Thrones’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘I Love Lucy’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘Breaking Bad’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘Malcom in the Middle’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘The Simpsons’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘Sneaky Pete’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘King of the Hill’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘Three’s Company’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘The Andy Griffith Show’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘Matlock’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘North and South’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘MASH’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Data download is complete

I think the issue here is that the print command is written with python 2 in mind?

Read more comments on GitHub >

github_iconTop Results From Across the Web

bug for UI Ingestion · Issue #4187 · datahub-project ... - GitHub
Today I install datahub on my linux server with command: datahub docker quickstart --dump-logs-on-failure --quickstart-compose-file .
Read more >
ERROR execution of infrastructure-agent-installer failed
FATAL We encountered an error during the installation: encountered an error while validating receipt of data for infrastructure-agent-installer: ...
Read more >
Troubleshoot Tableau Server Install and Upgrade
Limited disk space can cause a failure to install, a failure to upgrade, or problems running Tableau Server. Restart Tableau Server. Issues related...
Read more >
Check a crashed app for errors by using Error Reporting
Internal Server Error The server encountered an internal error and was unable to complete your request. Either the server is overloaded or there...
Read more >
druid overlord process unable to start error "Address already ...
Method 1 quit the app using the port: Check to see what application is running on port 8090 (this is the default).
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found