Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wrong seqNo is set when reading from eventhubs #488

Closed
k4jiang opened this issue Apr 2, 2020 · 7 comments
Closed

Wrong seqNo is set when reading from eventhubs #488

k4jiang opened this issue Apr 2, 2020 · 7 comments
Assignees
Labels
known issue This is a verified bug in the codebase.

Comments

@k4jiang
Copy link

k4jiang commented Apr 2, 2020

Bug Report:

The following is the stacktrace:

Job aborted due to stage failure: Task 30 in stage 2348.0 failed 4 times, most recent failure: Lost task 30.3 in stage 2348.0 (TID 4058, 10.139.64.6, executor 0): java.lang.IllegalStateException: In partition 30 of http-access-log, with consumer group $Default, request seqNo 19609525 is less than the received seqNo 19684911. The earliest seqNo is 19684804 and the last seqNo is 20231767
	at org.apache.spark.eventhubs.client.CachedEventHubsReceiver.checkCursor(CachedEventHubsReceiver.scala:189)
	at org.apache.spark.eventhubs.client.CachedEventHubsReceiver.org$apache$spark$eventhubs$client$CachedEventHubsReceiver$$receive(CachedEventHubsReceiver.scala:213)
	at org.apache.spark.eventhubs.client.CachedEventHubsReceiver$.receive(CachedEventHubsReceiver.scala:288)
	at org.apache.spark.eventhubs.rdd.EventHubsRDD.compute(EventHubsRDD.scala:120)
	at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:353)
	at org.apache.spark.rdd.RDD.iterator(RDD.scala:317)
	at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:60)
	at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:353)
	at org.apache.spark.rdd.RDD.iterator(RDD.scala:317)
	at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:60)
	at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:353)
	at org.apache.spark.rdd.RDD.iterator(RDD.scala:317)
	at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:60)
	at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:353)
	at org.apache.spark.rdd.RDD.iterator(RDD.scala:317)
	at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:60)
	at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:353)
	at org.apache.spark.rdd.RDD.iterator(RDD.scala:317)
	at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:60)
	at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:353)
	at org.apache.spark.rdd.RDD.iterator(RDD.scala:317)
	at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:99)
	at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:55)
	at org.apache.spark.scheduler.Task.doRunTask(Task.scala:140)
	at org.apache.spark.scheduler.Task.run(Task.scala:113)
	at org.apache.spark.executor.Executor$TaskRunner$$anonfun$13.apply(Executor.scala:537)
	at org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:1541)
	at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:543)
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
	at java.lang.Thread.run(Thread.java:748)
  • Expected behavior
    Offset is set correctly
  • Spark version
    2.4.5
  • spark-eventhubs artifactId and version
    com.microsoft.azure:azure-eventhubs-spark_2.11:2.3.14.1
@k4jiang
Copy link
Author

k4jiang commented Apr 2, 2020

@nyaghma I opened a new ticket as we see the same issue on 2.3.14.1. Do you mind taking a look?

@nyaghma nyaghma self-assigned this Apr 3, 2020
@nyaghma
Copy link
Contributor

nyaghma commented Apr 3, 2020

@k4jiang Thanks for informing us about the recurrence of this issue. I'll investigate it further.

@nyaghma
Copy link
Contributor

nyaghma commented Apr 3, 2020

What environment do you use? Are you using Databricks? If yes, is autoscaling on?

@k4jiang
Copy link
Author

k4jiang commented Apr 3, 2020

Yes, I am using Databricks. No, autoscaling is turned off.

@sjkwak sjkwak self-assigned this Apr 3, 2020
@sjkwak sjkwak added the known issue This is a verified bug in the codebase. label Apr 3, 2020
@sjkwak
Copy link
Member

sjkwak commented Apr 3, 2020

Hi @k4jiang it is a known issue and we're working on a fix. We are going to release a new version with the fix in several days.

@k4jiang
Copy link
Author

k4jiang commented Apr 14, 2020

@sjkwak Does the latest version (com.microsoft.azure:azure-eventhubs-spark_2.11:2.3.15) include the fix for this?

@nyaghma
Copy link
Contributor

nyaghma commented Jun 8, 2020

The version 2.3.15 includes the fix.

@nyaghma nyaghma closed this as completed Jun 8, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
known issue This is a verified bug in the codebase.
Projects
None yet
Development

No branches or pull requests

3 participants