quickstart fail ingest error during install
See original GitHub issueHi,
I have tried a few things and feel I have configured my machine in a way that reflects what is specified in the quickstart guide, but I am running into a few issues. I can complete the install with “success” but once the web test starts, wildfly goes up and then I am stuck with the following until timeout:
[DW-INFO] - Polling for EAR deployment status every 4 seconds (15 attempts max) – Wildfly process not found (1/15) ± Wildfly up (132683). EAR deployment pending (2/15)
Centos 7 64cores, 64GB ram
$ java -version
openjdk version "1.8.0_262"
OpenJDK Runtime Environment (build 1.8.0_262-b10)
OpenJDK 64-Bit Server VM (build 25.262-b10, mixed mode)
$ ulimit -u
32768
$ ulimit -n
32768
$ sudo sysctl vm.swappiness=0
$ git clone https://github.com/NationalSecurityAgency/datawave.git
$ echo "source my_path/contrib/datawave-quickstart/bin/env.sh" >> ~/.bashrc
$ source ~/.bashrc
$ allInstall
Potentially relevant error: Exception in thread “main” java.io.FileNotFoundException: File does not exist: hdfs://localhost:9000/datawave/ingest/work/jobCacheB/datawave-ws-common-util-4.0.0-SNAPSHOT.jar at org.apache.hadoop.hdfs.DistributedFileSystem$29.doCall(DistributedFileSystem.java:1587) at org.apache.hadoop.hdfs.DistributedFileSystem$29.doCall(DistributedFileSystem.java:1580) at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81) at org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1595) at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.getFileStatus(ClientDistributedCacheManager.java:325) at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.getFileStatus(ClientDistributedCacheManager.java:236) at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.determineTimestamps(ClientDistributedCacheManager.java:111) at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.determineTimestampsAndCacheVisibilities(ClientDistributedCacheManager.java:69) at org.apache.hadoop.mapreduce.JobResourceUploader.uploadResourcesInternal(JobResourceUploader.java:220) at org.apache.hadoop.mapreduce.JobResourceUploader.uploadResources(JobResourceUploader.java:133) at org.apache.hadoop.mapreduce.JobSubmitter.copyAndConfigureFiles(JobSubmitter.java:99) at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:194) at org.apache.hadoop.mapreduce.Job$11.run(Job.java:1570) at org.apache.hadoop.mapreduce.Job$11.run(Job.java:1567) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:422) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1729) at org.apache.hadoop.mapreduce.Job.submit(Job.java:1567) at datawave.ingest.mapreduce.job.IngestJob.run(IngestJob.java:377) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:76) at datawave.ingest.mapreduce.job.IngestJob.main(IngestJob.java:209) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:498) at org.apache.hadoop.util.RunJar.run(RunJar.java:318) at org.apache.hadoop.util.RunJar.main(RunJar.java:232) [DW-WARN] - The IngestJob class encountered errors (exit status: 1). See job log above for details
Post install:
====== Hadoop Status ====== pids: 103749 104126 104760 105121 105737 [DW-INFO] - NodeManager => 105121 [DW-INFO] - NameNode => 103749 [DW-INFO] - JobHistoryServer => 105737 [DW-INFO] - ResourceManager => 104760 [DW-INFO] - DataNode => 104126 [DW-WARN] - SecondaryNameNode is not running ====== Accumulo Status ====== pids: 106905 107002 107283 107361 107415 [DW-INFO] - gc => 107361 [DW-INFO] - master => 107283 [DW-INFO] - tracer => 107415 [DW-INFO] - monitor => 106905 [DW-INFO] - tserver => 107002 ====== ZooKeeper Status ====== [DW-INFO] - ZooKeeper => 106797 ====== DataWave Ingest Status ====== [DW-INFO] - No ingest processes are running ====== DataWave Web Status ====== [DW-WARN] - Wildfly is not running
Issue Analytics
- State:
- Created 3 years ago
- Comments:9
This might have already been dealt with under https://github.com/NationalSecurityAgency/datawave/issues/672
Trying this…
For step 4 on: https://code.nsa.gov/datawave/docs/2.9/tour/ingest-basics#step-1-define-the-data-type
$ ./ingest-tv-shows.sh --download-only --outfile ~/more-tv-shows.json [DW-INFO] - Writing json records to /home/schrogpu/more-tv-shows.json [DW-INFO] - Downloading show data: ‘Veep’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘Game of Thrones’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘I Love Lucy’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘Breaking Bad’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘Malcom in the Middle’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘The Simpsons’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘Sneaky Pete’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘King of the Hill’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘Three’s Company’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘The Andy Griffith Show’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘Matlock’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘North and South’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Downloading show data: ‘MASH’ File “<string>”, line 1 import sys,json;data=json.loads(sys.stdin.read()); print json.dumps(data, indent=2, sort_keys=True) ^ SyntaxError: invalid syntax [DW-INFO] - Data download is complete
I think the issue here is that the print command is written with python 2 in mind?