Use Case
Simple message fetching and printing from Kafka topic using Spark with Java as programming language
Experience in dealing with Kafka Storm Integration, developed and maintained kafka cluster and storm topologies more than a year.
No experience with Apache Spark and Scala
Simple word count application built and tested successfully using stand alone spark cluster.
Exception in thread "main" java.lang.NoSuchMethodError: scala.Predef$.ArrowAssoc(Ljava/lang/Object;)Ljava/lang/Object;
at org.apache.spark.streaming.kafka.KafkaUtils$.createStream(KafkaUtils.scala:64)
at org.apache.spark.streaming.kafka.KafkaUtils$.createStream(KafkaUtils.scala:110)
at org.apache.spark.streaming.kafka.KafkaUtils.createStream(KafkaUtils.scala)
at com.random.spark.EventsToFileAggregator.main(
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(
at sun.reflect.DelegatingMethodAccessorImpl.invoke(
at java.lang.reflect.Method.invoke(
at org.apache.spark.deploy.SparkSubmit$.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:731)
at org.apache.spark.deploy.SparkSubmit$.doRunMain$1(SparkSubmit.scala:181)
at org.apache.spark.deploy.SparkSubmit$.submit(SparkSubmit.scala:206)
at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:121)
at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
JavaPairReceiverInputDStream<String, String> messages =
KafkaUtils.createStream(jsc, args[0], args[1], topicMap,
Successful without any warnings
./bin/spark-submit --class com.random.spark.EventsToFileAggregator --master spark://host:7077 /usr/local/spark/stats/target/stats-1.0-SNAPSHOT-jar-with-dependencies.jar localhost:2181 test topic 2
NoSuchMethodError is almost always an indication that two libraries are not at a compatible version. In this case Spark-Streaming Kafka is attempting to use a Scala language feature that doesn't exist. Check that the version of Spark-Streaming Kafka is compatible with the version of Scala you're using. Make sure you're not actually running with Scala and not Java.
I have the following maven versions in my pom.xml (among others):
Camel spring-boot version = 3.7.0 and I want to connect to a SMB endpoint like this:
I read the Camel 3 Migration Guide and found nothing regarding this camel-extras.
When trying to connect, I get an error like the password option is not supported anymore:
Caused by: org.apache.camel.ResolveEndpointFailedException: Failed to resolve endpoint: smb:// due to: There are 1 parameters that couldn't be set on the endpoint. Check the uri if the parameters are spelt correctly and that they are properties of the endpoint. Unknown parameters=[{password=ThePassWorD}]
The actual documentation link google found many times, seems dead.
From Maven central, there is no version 3.x of the lib camel-jcifs and I am wondering if the lib is still compatible with Camel 3.x.x, otherwise is there another alternative with Camel 3?
I also tried to downgrade the camel-jcifs to 2.24.3 with the same error.
Camel-extras is a separated project from the Apache Camel. There is some work in place in the camel-extra repository to support camel 3[1], but it is still to be completed and there is no release in sight.
There is now a pull request to add camel-jcifs to the 3.x version:
You might also get my fork and build it yourself: *
It got merged and is in the official repository:
To use it with quarkus, you have to convert some List types to arrays.
I'm trying to run an Apache Beam application on a Flink cluster, but it fails with an error translating the Kafka UnboundedSource, saying that [partitions type:ARRAY pos:0] is not serializable. The application is a word count example reading from a Kafka topic and publishing to a Kafka topic, and it works fine using Beam's direct runner.
I created a pom.xml by following Beam's QuickStart Java and then added the KafkaIO sdk. I'm running a single-node local Flink 1.8.1 cluster and Kafka 2.3.0.
pom.xml snippets
<!-- Makes the FlinkRunner available when running a pipeline. -->
<!-- Please see the Flink Runner page for an up-to-date list
of supported Flink versions and their artifact names: -->
<!-- Tried with and without this flink-avro dependency -->
</dependency> snippet
// Create the Pipeline object with the options we defined above.
Pipeline p = Pipeline.create(options);
PCollection<KV<String, Long>> counts = p.apply(KafkaIO.<String, String>read()
.updateConsumerProperties(ImmutableMap.of("auto.offset.reset", (Object)"latest"))
.withoutMetadata() // PCollection<KV<Long, String>> instead of KafkaRecord type
The full error message, which is the result of submitting the Beam jar to Flink via /opt/flink/bin/flink run -c org.apache.beam.examples.KafkaWordCount target/word-count-beam-bundled-0.1.jar --runner=FlinkRunner --bootstrapServer=localhost:9092
The program finished with the following exception:
org.apache.flink.client.program.ProgramInvocationException: The main method caused an error: Error while translating UnboundedSource:
at org.apache.flink.client.program.PackagedProgram.callMainMethod(
at org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(
at org.apache.flink.client.cli.CliFrontend.executeProgram(
at org.apache.flink.client.cli.CliFrontend.runProgram(
at org.apache.flink.client.cli.CliFrontend.parseParameters(
at org.apache.flink.client.cli.CliFrontend.lambda$main$11(
at org.apache.flink.client.cli.CliFrontend.main(
Caused by: java.lang.RuntimeException: Error while translating UnboundedSource:
at org.apache.beam.runners.flink.FlinkStreamingTransformTranslators$UnboundedReadSourceTranslator.translateNode(
at org.apache.beam.runners.flink.FlinkStreamingTransformTranslators$ReadSourceTranslator.translateNode(
at org.apache.beam.runners.flink.FlinkStreamingPipelineTranslator.applyStreamingTransform(
at org.apache.beam.runners.flink.FlinkStreamingPipelineTranslator.visitPrimitiveTransform(
at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(
at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(
at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(
at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(
at org.apache.beam.sdk.runners.TransformHierarchy$Node.access$600(
at org.apache.beam.sdk.runners.TransformHierarchy.visit(
at org.apache.beam.sdk.Pipeline.traverseTopologically(
at org.apache.beam.runners.flink.FlinkPipelineTranslator.translate(
at org.apache.beam.runners.flink.FlinkStreamingPipelineTranslator.translate(
at org.apache.beam.runners.flink.FlinkPipelineExecutionEnvironment.translate(
at org.apache.beam.examples.KafkaWordCount.runWordCount(
at org.apache.beam.examples.KafkaWordCount.main(
at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(
at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(
at java.base/java.lang.reflect.Method.invoke(
at org.apache.flink.client.program.PackagedProgram.callMainMethod(
... 9 more
Caused by: org.apache.flink.api.common.InvalidProgramException: [partitions type:ARRAY pos:0] is not serializable. The object probably contains or references non serializable fields.
at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.clean(
at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.addSource(
at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.addSource(
at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.addSource(
at org.apache.beam.runners.flink.FlinkStreamingTransformTranslators$UnboundedReadSourceTranslator.translateNode(
... 32 more
Caused by: org.apache.avro.Schema$Field
at java.base/
at java.base/
at java.base/java.util.ArrayList.writeObject(
at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(
at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(
at java.base/java.lang.reflect.Method.invoke(
at java.base/
at java.base/
at java.base/
at java.base/
at java.base/
at org.apache.flink.util.InstantiationUtil.serializeObject(
... 42 more
Turns out there is an issue in Beam related to running on Flink that seems to be related to this: One of the comments on it specifically mentions that using flink/run with KafkaIO doesn't work due to Avro's Schema.Field not being serializable:
As mentioned in the comments, a workaround is to downgrade to Flink to 1.8.0.
Im using the Keycloak admin client (version 4.5.0.Final) and am trying to do some simple queries such as looking up a user. The client code is running in a plugin module in another java server, not standalone. The code looks like this:
try {
Keycloak kc = Keycloak.getInstance(URL, REALM, USER, PWD, CLIENT_ID);
UserRepresentation kcuser = kc.realm(REALM).users().get(USER).toRepresentation();
trace(String.format("Got user: %s", kcuser.toString()));
} catch (Exception e) {
trace("Error authenticating: " + e);
It creates the kc instance successfully, but barfs when trying to look up the user.
This is the error: RESTEASY003215: could not find writer for content-type application/x-www-form-urlencoded type:$1
at org.jboss.resteasy.core.interception.ClientWriterInterceptorContext.throwWriterNotFoundException(
at org.jboss.resteasy.core.interception.AbstractWriterInterceptorContext.getWriter(
at org.jboss.resteasy.core.interception.AbstractWriterInterceptorContext.proceed(
at org.jboss.resteasy.client.jaxrs.internal.ClientInvocation.writeRequestBody(
at org.jboss.resteasy.client.jaxrs.engines.ApacheHttpClient4Engine.writeRequestBodyToOutputStream(
at org.jboss.resteasy.client.jaxrs.engines.ApacheHttpClient4Engine.buildEntity(
at org.jboss.resteasy.client.jaxrs.engines.ApacheHttpClient4Engine.loadHttpMethod(
at org.jboss.resteasy.client.jaxrs.engines.ApacheHttpClient4Engine.invoke(
at org.jboss.resteasy.client.jaxrs.internal.ClientInvocation.invoke(
at org.jboss.resteasy.client.jaxrs.internal.proxy.ClientInvoker.invokeSync(
at org.jboss.resteasy.client.jaxrs.internal.proxy.ClientInvoker.invoke(
at org.jboss.resteasy.client.jaxrs.internal.proxy.ClientProxy.invoke(
at com.sun.proxy.$Proxy362.grantToken(Unknown Source)
at org.keycloak.admin.client.token.TokenManager.grantToken(
at org.keycloak.admin.client.token.TokenManager.getAccessToken(
at org.keycloak.admin.client.token.TokenManager.getAccessTokenString(
at org.keycloak.admin.client.resource.BearerAuthFilter.filter(
at org.jboss.resteasy.client.jaxrs.internal.ClientInvocation.filterRequest(
at org.jboss.resteasy.client.jaxrs.internal.ClientInvocation.invoke(
at org.jboss.resteasy.client.jaxrs.internal.proxy.ClientInvoker.invokeSync(
at org.jboss.resteasy.client.jaxrs.internal.proxy.ClientInvoker.invoke(
at org.jboss.resteasy.client.jaxrs.internal.proxy.ClientProxy.invoke(
at com.sun.proxy.$Proxy372.toRepresentation(Unknown Source)
My pom has the latest dependencies and classpath seems ok, any ideas why this is not working?
I noticed that during the instantiation of a new Keycloak instance, resteasy is checking here for the available providers with the help of the current thread. In version 3.9.1.Final which is currently used by the last keycloak-admin-client so far (version 11.0.0).
In my specific case we are using keycloak-admin-client in combination with graphql-java and CompletableFuture.supplyAsync for our data loaders. Which implies that in some cases, without further configuration, the current thread is not an instance of Thread but actually ForkJoinWorkerThread. Which apparently breaks the retrieval of the providers.
I am still a beginner to java so I would be glad if someone could explain why the registerProviders method does not work with a ForkJoinWorkerThread.
I learned on DZone is that JVM sizes the commonPool to two threads when you have more than 2 CPUs available. So I tried and noticed that my app works with 2 CPUs, but I have the same error (RESTEASY003215) with 3 CPUs.
My current "workaround" is to to use CompletableFuture.completedStage when loading data using the keycloak-admin-client.
I am trying to connect to spark master on a remote system through java app
I am using
<dependency> <!-- Spark dependency -->
and code
SparkSession sparkSession = SparkSession.builder().
.appName("spark session example")
JavaSparkContext sc = new JavaSparkContext(sparkSession.sparkContext());
Exception in thread "main" java.lang.NoSuchMethodError: scala.Predef$.ArrowAssoc(Ljava/lang/Object;)Ljava/lang/Object;
at org.apache.spark.sql.SparkSession$Builder.config(SparkSession.scala:713)
at org.apache.spark.sql.SparkSession$Builder.master(SparkSession.scala:766)
at com.mobelisk.spark.JavaSparkPi.main(
Also If I change to
<dependency> <!-- Spark dependency -->
on the same program getting
Caused by: java.lang.RuntimeException: org.apache.spark.rpc.netty.RequestMessage; local class incompatible: stream classdesc serialVersionUID = -2221986757032131007, local class serialVersionUID = -5447855329526097695
In Spark-shell on remote
Spark context available as 'sc' (master = local[*], app id = local-1477561433881).
Spark session available as 'spark'.
Welcome to
____ __
/ / _ _____/ /
_\ / _ / _ `/ / '/
// .__/_,// //_\ version 2.0.1
Using Scala version 2.11.8 (Java HotSpot(TM) 64-Bit Server VM, Java 1.8.0_101)
Type in expressions to have them evaluated.
Type :help for more information.
As I am very new to all this, I am not able to figure out the issue in program
I figured it out, posting this in case if someone is going to follow the similar approach.
I had added
which comes with scala-library 2.10.6
but there already exists a scala-library 2.11.8 in spark-core
so I had to exclude the earlier one like this
Now everything is working fine
This Spark version mismatch:
you use 2.10 in project.
cluster uses 2.11
Update dependency to 2.11.
I was referring to a post here:
Connecting to Zookeeper in a Apache Kafka Multi Node cluster
Its mentioned here that from kafka V9 version, Producer and Consumer does not have to use the zookeeper.connect property and just the bootstrap.servers is enough to producer/consume data.
My POM.xml looks like this in the consumer side:
I run into the following issue in the consumer side, without zookeeper.connect property. Does anyone has the consumer part working without the zookeeper connect property ?
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(
at sun.reflect.DelegatingMethodAccessorImpl.invoke(
at java.lang.reflect.Method.invoke(
at org.codehaus.mojo.exec.ExecJavaMojo$
Caused by: java.lang.IllegalArgumentException: requirement failed: Missing required property 'zookeeper.connect'
at scala.Predef$.require(Predef.scala:233)
at kafka.utils.VerifiableProperties.getString(VerifiableProperties.scala:177)
at kafka.utils.ZKConfig.<init>(ZkUtils.scala:902)
at kafka.consumer.ConsumerConfig.<init>(ConsumerConfig.scala:101)
at kafka.consumer.ConsumerConfig.<init>(ConsumerConfig.scala:105)
at io.confluent.examples.consumer.ConsumerGroup.<init>(
at io.confluent.examples.consumer.ConsumerGroup.main(
... 6 more
Only the new consumer works without connecting to Zookeeper and that is available in the kafka-clients artifact. You have to add the dependency:
and use the implementation from the org.apache.kafka.clients.consumer. package.