pvserver disconnecting

Hello

I am trying to access my simulation data in the cluster using the pvserver and the connection becomes unstable after I select the nek5000 field file to open and displays the message below.
Can you explain what is happening and the rectification?


image

Thank you
Pranay

Please share your nek5000 file.

this file links to the field files for visualization. This is just a binder file.
field0.nek5000 (92 Bytes)

I confirm that ParaView is unable to read this file and crashes, please open an issue: https://gitlab.kitware.com/paraview/paraview/-/issues

I can open this file using the normal local paraview, just that the pvserver version has this issue. I will open an issue.
PS: the paraview version I am using is 5.12.0

Thank you

Hi!
when I opened the nek5000 file in a notepad, it looked like a metadata file that points to other files. I don’t know if I am wrong but without the actual linked data, I was wondering how ParaView/pvserver can open it. When I opened the file in local ParaView, obviously I get the same error you are getting. Since you have not shared the nek5000 linked files with Mathieu, I wish to ask… did you miss to copy the nek5000 linked files to the cluster as well?

This thread also reminds me of the [*.mhd and .raw file] pair, where .mhd is a metadata pointing to a .raw data. Failing to provide the raw data in the same folder while loading the .mhd file is a problem, but there I usually get an error instead of a crash. :slight_smile:

1 Like

Yes, as I mentioned before this is a binder file that connects other field files which has the timestamp data. I just posted the binder file and my cluster had the linked data. Each file has around 2.3 GB of data and hence I couldn’t post the data here.

Just to add to that, I think there is a problem in the cluster version of paraview as when I try to open it directly by loading paraview in the cluster I get the following error:

[axje9953@hpcl001 p_6]$ paraview
Error parsing file.  Unknown tag type:
[hpcl001:3162786:0:3162786] Caught signal 11 (Segmentation fault: address not mapped to object at address (nil))
==== backtrace (tid:3162786) ====
 0 0x000000000004ead0 killpg()  ???:0
 1 0x00000000000dc47b QThreadStorageData::get()  ???:0
 2 0x00000000001789f4 QOpenGLContextPrivate::setCurrentContext()  ???:0
 3 0x0000000000179e79 QOpenGLContext::makeCurrent()  ???:0
 4 0x00000000000353b1 vtkObject::vtkClassMemberCallback<QVTKRenderWindowAdapter::QVTKInternals>::operator()()  :0
 5 0x00000000010a8f5b vtkSubjectHelper::InvokeEvent()  :0
 6 0x000000000021e7b5 vtkViewport::RemoveViewProp()  ???:0
 7 0x000000000010c2db vtkAbstractWidget::~vtkAbstractWidget()  ???:0
 8 0x000000000015dc83 vtkCameraOrientationWidget::~vtkCameraOrientationWidget()  :0
 9 0x00000000002128e7 vtkPVRenderView::~vtkPVRenderView()  ???:0
10 0x0000000000212949 vtkPVRenderView::~vtkPVRenderView()  ???:0
11 0x00000000012ba73e vtkSmartPointerBase::~vtkSmartPointerBase()  ???:0
12 0x00000000012ba7fe vtkSmartPointerBase::operator=()  ???:0
13 0x0000000000106d69 vtkSIProxy::DeleteVTKObjects()  vtkSIProxy.cxx:0
14 0x0000000000107c50 vtkSIProxy::~vtkSIProxy()  ???:0
15 0x0000000000107f29 vtkSIProxy::~vtkSIProxy()  ???:0
16 0x00000000000e9785 vtkPVSessionCore::~vtkPVSessionCore()  ???:0
17 0x00000000000e9969 vtkPVSessionCore::~vtkPVSessionCore()  ???:0
18 0x00000000000e5c58 vtkPVSessionBase::~vtkPVSessionBase()  ???:0
19 0x000000000020dc79 vtkSMSession::~vtkSMSession()  ???:0
20 0x00000000012ba73e vtkSmartPointerBase::~vtkSmartPointerBase()  ???:0
21 0x0000000000047c1e vtkProcessModule::~vtkProcessModule()  ???:0
22 0x0000000000047ca9 vtkProcessModule::~vtkProcessModule()  ???:0
23 0x00000000012ba73e vtkSmartPointerBase::~vtkSmartPointerBase()  ???:0
24 0x0000000000051717 __cxa_finalize()  ???:0
25 0x000000000003c6f7 __do_global_dtors_aux()  crtstuff.c:0
=================================
[hpcl001:3162786] *** Process received signal ***
[hpcl001:3162786] Signal: Segmentation fault (11)
[hpcl001:3162786] Signal code:  (-6)
[hpcl001:3162786] Failing at address: 0x6877003042a2
[hpcl001:3162786] [ 0] /lib64/libc.so.6(+0x4ead0)[0x155554fafad0]
[hpcl001:3162786] [ 1] /cm/shared/uniol/sw/zen4/13.1/Qt5/5.15.10-GCCcore-13.1.0/lib/libQt5Core.so.5(_ZNK18QThreadStorageData3getEv+0x2b)[0x15554de7547b]
[hpcl001:3162786] [ 2] /cm/shared/uniol/sw/zen4/13.1/Qt5/5.15.10-GCCcore-13.1.0/lib/libQt5Gui.so.5(_ZN21QOpenGLContextPrivate17setCurrentContextEP14QOpenGLContext+0x34)[0x1555539a69f4]
[hpcl001:3162786] [ 3] /cm/shared/uniol/sw/zen4/13.1/Qt5/5.15.10-GCCcore-13.1.0/lib/libQt5Gui.so.5(_ZN14QOpenGLContext11makeCurrentEP8QSurface+0xa9)[0x1555539a7e79]
[hpcl001:3162786] [ 4] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkGUISupportQt-pv5.12.so.1(+0x353b1)[0x1555553aa3b1]
[hpcl001:3162786] [ 5] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkCommonCore-pv5.12.so.1(+0x10a8f5b)[0x15554f86df5b]
[hpcl001:3162786] [ 6] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRenderingCore-pv5.12.so.1(_ZN11vtkViewport14RemoveViewPropEP7vtkProp+0xb5)[0x15555223f7b5]
[hpcl001:3162786] [ 7] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkInteractionWidgets-pv5.12.so.1(_ZN17vtkAbstractWidgetD1Ev+0x3b)[0x155552f752db]
[hpcl001:3162786] [ 8] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkInteractionWidgets-pv5.12.so.1(+0x15dc83)[0x155552fc6c83]
[hpcl001:3162786] [ 9] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRemotingViews-pv5.12.so.1(_ZN15vtkPVRenderViewD2Ev+0x367)[0x15554bacb8e7]
[hpcl001:3162786] [10] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRemotingViews-pv5.12.so.1(_ZN15vtkPVRenderViewD0Ev+0x9)[0x15554bacb949]
[hpcl001:3162786] [11] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkCommonCore-pv5.12.so.1(_ZN19vtkSmartPointerBaseD1Ev+0x1e)[0x15554fa7f73e]
[hpcl001:3162786] [12] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkCommonCore-pv5.12.so.1(_ZN19vtkSmartPointerBaseaSEP13vtkObjectBase+0x2e)[0x15554fa7f7fe]
[hpcl001:3162786] [13] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRemotingServerManager-pv5.12.so.1(_ZN10vtkSIProxy16DeleteVTKObjectsEv+0x29)[0x1555535e9d69]
[hpcl001:3162786] [14] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRemotingServerManager-pv5.12.so.1(_ZN10vtkSIProxyD1Ev+0x20)[0x1555535eac50]
[hpcl001:3162786] [15] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRemotingServerManager-pv5.12.so.1(_ZN10vtkSIProxyD0Ev+0x9)[0x1555535eaf29]
[hpcl001:3162786] [16] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRemotingServerManager-pv5.12.so.1(_ZN16vtkPVSessionCoreD2Ev+0x185)[0x1555535cc785]
[hpcl001:3162786] [17] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRemotingServerManager-pv5.12.so.1(_ZN16vtkPVSessionCoreD0Ev+0x9)[0x1555535cc969]
[hpcl001:3162786] [18] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRemotingServerManager-pv5.12.so.1(_ZN16vtkPVSessionBaseD1Ev+0x38)[0x1555535c8c58]
[hpcl001:3162786] [19] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRemotingServerManager-pv5.12.so.1(_ZN12vtkSMSessionD0Ev+0x9)[0x1555536f0c79]
[hpcl001:3162786] [20] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkCommonCore-pv5.12.so.1(_ZN19vtkSmartPointerBaseD1Ev+0x1e)[0x15554fa7f73e]
[hpcl001:3162786] [21] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRemotingCore-pv5.12.so.1(_ZN16vtkProcessModuleD1Ev+0xae)[0x1555525c2c1e]
[hpcl001:3162786] [22] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRemotingCore-pv5.12.so.1(_ZN16vtkProcessModuleD0Ev+0x9)[0x1555525c2ca9]
[hpcl001:3162786] [23] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkCommonCore-pv5.12.so.1(_ZN19vtkSmartPointerBaseD1Ev+0x1e)[0x15554fa7f73e]
[hpcl001:3162786] [24] /lib64/libc.so.6(__cxa_finalize+0xf7)[0x155554fb2717]
[hpcl001:3162786] [25] /cm/shared/uniol/sw/zen4/13.1/ParaView/5.12.0-foss-2023a-mpi/lib/libvtkRemotingCore-pv5.12.so.1(+0x3c6f7)[0x1555525b76f7]
[hpcl001:3162786] *** End of error message ***
Segmentation fault (core dumped)
[axje9953@hpcl001 p_6]$ module avail paraview

Thanks for showing the logs. Your cluster version of ParaView is not too old.
Without having the linked files, I get the following error messages (local ParaView 5.12.0) before the crash:

Error parsing file.  Unknown tag type:
(   6.956s) [paraview        ]             loguru.cpp:479      1| .   atexit
[sherin-pc:10063] *** Process received signal ***
[sherin-pc:10063] Signal: Segmentation fault (11)
[sherin-pc:10063] Signal code: Address not mapped (1)
[sherin-pc:10063] Failing at address: 0x98c

A segfault is never an accepted outcome, even for incomplete file, please open an issue @Pranay_006

1 Like

I will open an issue.
Can I know why the paraview installed on cluster is failing whereas if I try to open the file with local, though it keeps on displaying the message of not responding… eventually I get the simulation displayed albeit being very slow. Is it because of mpi distribution in the cluster?

Can I avert this issue with the new version of 5.13?

I was suggesting to open the issue about ParaView crashing locally but thats ok.

Please share your whole data then.

As I said the complete data is large, so I have few timestamps shared with you. This problem is in the cluster version(mpi) and not in the local (serial - RC1) version.

Link: sample.zip

Let me know if you need anything else.

Thanks
Pranay

Did a quick check in my local paraview.

  • Crashes with the same error when I used the Nek5000 Reader.
  • Opens without issues when I used the VisIt Nek5000 Reader.

Which reader are you using in the cluster?

I am using the VisIt Nek5000 reader. As I said I could open this using a local paraview via remote folders to the cluster whereas if I open it directly using the paraview in the cluster I get the message as discussed above. Nek5000 reader never worked and resulted in a crash always

The file you provided cannot be opened locally, please share a complete data, for example using wetransfer.com

Hello @mwestphal,

Can I know the reason why you cannot open the file locally? I have managed to open it and then @sherin.sugathan managed to open it as well. Can you be more specific?

Can I know what is needed exactly to sort this issue?

  • ParaView local (5.12.0 prebuilt) opens the file successfully.
  • ParaView local (5.12.0 custom built from source) fails to read the file.
  • ParaView remote (5.12.0 custom built from source) when accessed directly, fails to read the file.

I may be wrong, but I am just making inferences based on my above observations. I suspect that the cluster version of your ParaView was built from source and without the necessary CMAKE option needed for enabling the VisIt nek reader.

Earlier, you said that you can run ParaView directly in your cluster. Could you please run ParaView directly in your cluster, then goto Edit → Settings and then under the IO tab you will find a list named Reader Selection. Scroll down and tell how many entries of “Nek5000 Files” you see there. 1 or 2?

I see only one entry available in the cluster installation of Paraview 5.12.0