Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-2362] Fix for newFilesOnly logic in file DStream #1077

Closed
wants to merge 1 commit into from
Closed

[SPARK-2362] Fix for newFilesOnly logic in file DStream #1077

wants to merge 1 commit into from

Conversation

gabrielenizzoli
Copy link
Contributor

The newFilesOnly logic should be inverted: the logic should be that if the flag newFilesOnly==true then only start reading files older than current time. As the code is now if newFilesOnly==true then it will start to read files that are older than 0L (that is: every file in the directory).

The newFilesOnly logic should be inverted: if newFilesOnly==true then only start reading files older than current time. As the code is now if newFilesOnly==true then it will start to read files that are older than 0L (that is: every file in the directory).
@AmplabJenkins
Copy link

Can one of the admins verify this patch?

@tdas
Copy link
Contributor

tdas commented Jul 4, 2014

@gabrielenizzoli Can you please add the "[SPARK-2362]" in the title, like other PRs. 2362 is the JIRA for this bug.

@gabrielenizzoli gabrielenizzoli changed the title Fix for newFilesOnly logic in file DStream [SPARK-2362] Fix for newFilesOnly logic in file DStream Jul 4, 2014
@gabrielenizzoli
Copy link
Contributor Author

@tdas done

@tdas
Copy link
Contributor

tdas commented Jul 8, 2014

Jenkins, this is ok to test.

@tdas
Copy link
Contributor

tdas commented Jul 8, 2014

Jenkins, test this please.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@AmplabJenkins
Copy link

Merged build finished.

@AmplabJenkins
Copy link

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16422/

@tdas
Copy link
Contributor

tdas commented Jul 8, 2014

Jenkins, test this please.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@asfgit asfgit closed this in e6f7bfc Jul 8, 2014
asfgit pushed a commit that referenced this pull request Jul 8, 2014
The newFilesOnly logic should be inverted: the logic should be that if the flag newFilesOnly==true then only start reading files older than current time. As the code is now if newFilesOnly==true then it will start to read files that are older than 0L (that is: every file in the directory).

Author: Gabriele Nizzoli <mail@nizzoli.net>

Closes #1077 from gabrielenizzoli/master and squashes the following commits:

4f1d261 [Gabriele Nizzoli] Fix for newFilesOnly logic in file DStream

(cherry picked from commit e6f7bfc)
Signed-off-by: Tathagata Das <tathagata.das1565@gmail.com>
asfgit pushed a commit that referenced this pull request Jul 8, 2014
The newFilesOnly logic should be inverted: the logic should be that if the flag newFilesOnly==true then only start reading files older than current time. As the code is now if newFilesOnly==true then it will start to read files that are older than 0L (that is: every file in the directory).

Author: Gabriele Nizzoli <mail@nizzoli.net>

Closes #1077 from gabrielenizzoli/master and squashes the following commits:

4f1d261 [Gabriele Nizzoli] Fix for newFilesOnly logic in file DStream

(cherry picked from commit e6f7bfc)
Signed-off-by: Tathagata Das <tathagata.das1565@gmail.com>
@AmplabJenkins
Copy link

Merged build finished. All automated tests passed.

@AmplabJenkins
Copy link

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16423/

gzm55 pushed a commit to MediaV/spark that referenced this pull request Jul 18, 2014
The newFilesOnly logic should be inverted: the logic should be that if the flag newFilesOnly==true then only start reading files older than current time. As the code is now if newFilesOnly==true then it will start to read files that are older than 0L (that is: every file in the directory).

Author: Gabriele Nizzoli <mail@nizzoli.net>

Closes apache#1077 from gabrielenizzoli/master and squashes the following commits:

4f1d261 [Gabriele Nizzoli] Fix for newFilesOnly logic in file DStream

(cherry picked from commit e6f7bfc)
Signed-off-by: Tathagata Das <tathagata.das1565@gmail.com>
xiliu82 pushed a commit to xiliu82/spark that referenced this pull request Sep 4, 2014
The newFilesOnly logic should be inverted: the logic should be that if the flag newFilesOnly==true then only start reading files older than current time. As the code is now if newFilesOnly==true then it will start to read files that are older than 0L (that is: every file in the directory).

Author: Gabriele Nizzoli <mail@nizzoli.net>

Closes apache#1077 from gabrielenizzoli/master and squashes the following commits:

4f1d261 [Gabriele Nizzoli] Fix for newFilesOnly logic in file DStream
udaynpusa pushed a commit to mapr/spark that referenced this pull request Jan 30, 2024
* EZAF-1127 Minor fix of container startup time by improvement in configure.sh / fix for Spark 3.3.2

* Improve changePermission function in configure.sh

* [classpathfilter]  Add classpathfilter go module (apache#1078)

* Add classpathfilter go module

* Properly copy classpathfilter into distribution

* Update bin/mapr-classpath.sh

* Change content of dep-blacklist.txt (apache#1079)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants