Thread Blocked - 2000ms limit

camhammel · April 26, 2020, 7:06pm

Hi everyone,

Although my network is running fairly well, my console is littered with the following message every few seconds:

Apr 26, 2020 3:03:20 PM io.vertx.core.impl.BlockedThreadChecker
WARNING: Thread Thread[vert.x-eventloop-thread-0,5,main]=Thread[vert.x-eventloop-thread-0,5,main] has been blocked for 2301 ms, time limit is 2000 ms

I feel like resolving this could speed up the runtime of the training process. Each message has the same content, with the exception of the time blocked and the current date/time.

I appreciate any help,
Thanks!

raver119 · April 26, 2020, 7:17pm

Can you show your source code? DL4J itself doesn’t use vertx

camhammel · April 26, 2020, 7:21pm

Sure, here’s the class I’m running:

public class App
{
UIServer uiServer;
File trainData, testData;

public App() throws IOException
{
    //configure training data and normalizer
    trainData = new File("D:\\cam29\\Downloads\\100-bird-species\\180\\train");
    FileSplit trainSplit = new FileSplit(trainData, NativeImageLoader.ALLOWED_FORMATS, new Random());
    ParentPathLabelGenerator labelMaker = new ParentPathLabelGenerator();
    ImageRecordReader rr = new ImageRecordReader(224,224,3, labelMaker);
    rr.initialize(trainSplit);
    DataSetIterator trainIterator = new RecordReaderDataSetIterator(rr,64,1,180);
    DataNormalization imageScaler = new ImagePreProcessingScaler();
    imageScaler.fit(trainIterator);
    trainIterator.setPreProcessor(imageScaler);

    //configure testing data
    testData = new File("D:\\cam29\\Downloads\\100-bird-species\\180\\test");
    FileSplit testSplit = new FileSplit(testData, NativeImageLoader.ALLOWED_FORMATS, new Random());
    ImageRecordReader rrTest = new ImageRecordReader(224,224,3, labelMaker);
    rrTest.initialize(testSplit);
    DataSetIterator testIterator = new RecordReaderDataSetIterator(rrTest, 64, 1, 180);
    testIterator.setPreProcessor(imageScaler);


    ConvolutionLayer layer0 = new ConvolutionLayer.Builder(5,5)
            .nIn(3)
            .nOut(16)
            .stride(1,1)
            .padding(2,2)
            .weightInit(WeightInit.XAVIER)
            .name("First convolution layer")
            .activation(Activation.RELU)
            .build();

    SubsamplingLayer layer1 = new SubsamplingLayer.Builder(SubsamplingLayer.PoolingType.MAX)
            .kernelSize(2,2)
            .stride(2,2)
            .name("First subsampling layer")
            .build();

    ConvolutionLayer layer2 = new ConvolutionLayer.Builder(5,5)
            .nOut(20)
            .stride(1,1)
            .padding(2,2)
            .weightInit(WeightInit.XAVIER)
            .name("Second convolution layer")
            .activation(Activation.RELU)
            .build();

    SubsamplingLayer layer3 = new SubsamplingLayer.Builder(SubsamplingLayer.PoolingType.MAX)
            .kernelSize(2,2)
            .stride(2,2)
            .name("Second subsampling layer")
            .build();

    ConvolutionLayer layer4 = new ConvolutionLayer.Builder(5,5)
            .nOut(20)
            .stride(1,1)
            .padding(2,2)
            .weightInit(WeightInit.XAVIER)
            .name("Third convolution layer")
            .activation(Activation.RELU)
            .build();

    SubsamplingLayer layer5 = new SubsamplingLayer.Builder(SubsamplingLayer.PoolingType.MAX)
            .kernelSize(2,2)
            .stride(2,2)
            .name("Third subsampling layer")
            .build();

    OutputLayer layer6 = new OutputLayer.Builder(LossFunctions.LossFunction.NEGATIVELOGLIKELIHOOD)
            .activation(Activation.SOFTMAX)
            .weightInit(WeightInit.XAVIER)
            .name("Output")
            .nOut(180)
            .build();

    MultiLayerConfiguration configuration = new NeuralNetConfiguration.Builder()
            .seed(System.currentTimeMillis())
            .optimizationAlgo(OptimizationAlgorithm.STOCHASTIC_GRADIENT_DESCENT)
            //.regularization(true)
            //.l1(0.0001)
            //.l2(0.0002)
            .l2(0.0004)
            //.dropOut(0.8)
            .updater(new Nesterovs(0.001, 0.9))
            .list()
            .layer(0, layer0)
            .layer(1, layer1)
            .layer(2, layer2)
            .layer(3, layer3)
            .layer(4, layer4)
            .layer(5, layer5)
            .layer(6, layer6)
            //.pretrain(false)
            //.backprop(true)
            .setInputType(InputType.convolutional(224,224,3))
            .build();

    MultiLayerNetwork network = new MultiLayerNetwork(configuration);
    network.init();

    network.setListeners(new ScoreIterationListener(10));
    attachUI(network);

    double start_time = System.currentTimeMillis();
    network.fit(trainIterator, 50);
    //network.evaluateROCMultiClass(testIterator);
    Evaluation evaluation = network.evaluate(testIterator);
    double end_time = System.currentTimeMillis();
    if (evaluation.accuracy() > 0.15)
    {
        System.out.println("This run took " + (end_time - start_time)/1000/60  + " minutes.");
        System.out.println(evaluation.stats());
    }
    else
    {
        System.out.println("Accuracy low at " + evaluation.accuracy());
        System.out.println("Repeating...");
        new App();
    }
    uiServer.stop();
}

public static void main(String[] args) {
    try {
        App a = new App();
    } catch (IOException e) {
        e.printStackTrace();
    }
}

public void attachUI(MultiLayerNetwork mln)
{
    uiServer = UIServer.getInstance();

    StatsStorage statsStorage = new FileStatsStorage(new File(System.getProperty("java.io.tmpdir"), "ui-stats.dl4j"));
    uiServer.detach(statsStorage);
    int listenerFrequency = 20;
    uiServer.attach(statsStorage);
    mln.setListeners(new StatsListener(statsStorage, listenerFrequency));
}

}

raver119 · April 26, 2020, 7:33pm

I see. Disable UI then?

camhammel · April 26, 2020, 8:50pm

Disabling the UI worked! It seems odd to me that the UI would be causing that though…

Foodyborris · April 30, 2020, 4:46pm

I believe the UI is using this to provide an HTTP service (I may be wrong). I get the same messages occasionally.

raver119 · April 30, 2020, 4:49pm

Can you file an issue please? Issues · eclipse/deeplearning4j · GitHub

Topic		Replies	Views
Null pointer when running the UIServer DL4J	8	467	August 29, 2021
Can't visualize training process DL4J	24	1517	March 2, 2021
Linux performance issues in highly threaded environment DL4J	1	335	July 29, 2021
Forward pass has very high execution times DL4J	4	212	March 5, 2023
Unable to perform distributed training using tiny ImageNet example DL4J	9	596	September 1, 2021

Thread Blocked - 2000ms limit

Related topics