SPARK: 1.3.2 - ASF JIRA

1–106 of 106View in Issue Navigator

Key	Summary	Assignee	Status
SPARK-5456	Decimal Type comparison issue	Adrian Wang	Resolved
SPARK-6595	DataFrame self joins with MetastoreRelations fail	Michael Armbrust	Resolved
SPARK-6886	Big closure in PySpark will fail during shuffle	Davies Liu	Resolved
SPARK-6967	Internal DateType not handled correctly in caching	Adrian Wang	Resolved
SPARK-7660	Snappy-java buffer-sharing bug leads to data corruption / test failures	Josh Rosen	Resolved
SPARK-8781	Published POMs are no longer effective POMs	Andrew Or	Closed
SPARK-8819	Spark doesn't compile with maven 3.3.x	Andrew Or	Resolved
SPARK-3190	Creation of large graph(> 2.15 B nodes) seems to be broken:possible overflow somewhere	Ankur Dave	Resolved
SPARK-5074	Flaky test: o.a.s.scheduler.DAGSchedulerSuite	Shixiong Zhu	Resolved
SPARK-6781	sqlCtx -> sqlContext in pyspark shell	Davies Liu	Resolved
SPARK-6931	python: struct.pack('!q', value) in write_long(value, stream) in serializers.py require int(but doesn't raise exceptions in common cases)	Bryan Cutler	Resolved
SPARK-6985	Receiver maxRate over 1000 causes a StackOverflowError	David McGuire	Resolved
SPARK-7070	LDA.setBeta calls itself	Xiangrui Meng	Resolved
SPARK-7103	SparkContext.union crashed when some RDDs have no partitioner	Steven She	Resolved
SPARK-7181	External Sorter merge with aggregation go to an infinite loop when we have a total ordering	Qiping Li	Resolved
SPARK-7204	Call sites in UI are not accurate for DataFrame operations	Patrick Wendell	Resolved
SPARK-7417	Flaky test: o.a.s.deploy.SparkSubmitUtilsSuite neglect dependencies	Burak Yavuz	Closed
SPARK-7418	Flaky test: o.a.s.deploy.SparkSubmitUtilsSuite search for artifacts	Burak Yavuz	Closed
SPARK-7563	OutputCommitCoordinator.stop() should only be executed in driver	Josh Rosen	Resolved
SPARK-8309	OpenHashMap doesn't work with more than 12M items	Vyacheslav Baranov	Resolved
SPARK-8606	Exceptions in RDD.getPreferredLocations() and getPartitions() should not be able to crash DAGScheduler	Josh Rosen	Resolved
SPARK-9175	BLAS.gemm fails to update matrix C when alpha==0 and beta!=1	Meihua Wu	Resolved
SPARK-10169	Evaluating AggregateFunction1 (old code path) may return wrong answers when grouping expressions are used as arguments of aggregate functions	Yin Huai	Resolved
SPARK-10381	Infinite loop when OutputCommitCoordination is enabled and OutputCommitter.commitTask throws exception	Josh Rosen	Resolved
SPARK-11302	Multivariate Gaussian Model with Covariance matrix returns incorrect answer in some cases	Sean R. Owen	Resolved
SPARK-11424	Guard against MAPREDUCE-5918 by ensuring RecordReader is only closed once in *HadoopRDD	Josh Rosen	Resolved
SPARK-2018	Big-Endian (IBM Power7) Spark Serialization issue	Tim Ellison	Resolved
SPARK-4315	PySpark pickling of pyspark.sql.Row objects is extremely inefficient	Davies Liu	Resolved
SPARK-5220	keepPushingBlocks in BlockGenerator terminated when an exception occurs, which causes the block pushing thread to terminate and blocks receiver	Hari Shreedharan	Resolved
SPARK-5412	Cannot bind Master to a specific hostname as per the documentation	Sean R. Owen	Closed
SPARK-5529	BlockManager heartbeat expiration does not kill executor	shenh062326	Closed
SPARK-5969	The pyspark.rdd.sortByKey always fills only two partitions when ascending=False.	Milan Straka	Resolved
SPARK-6506	python support yarn cluster mode requires SPARK_HOME to be set	Marcelo Masiero Vanzin	Resolved
SPARK-6766	StreamingListenerBatchSubmitted isn't sent and StreamingListenerBatchStarted.batchInfo.processingStartTime is a wrong value	Shixiong Zhu	Resolved
SPARK-6905	Upgrade Snappy Java to 1.1.1.7 to fix bug that resulted in worse compression	Josh Rosen	Resolved
SPARK-6954	ExecutorAllocationManager can end up requesting a negative number of executors	Sandy Ryza	Resolved
SPARK-6998	Make StreamingKMeans `Serializable`	Shixiong Zhu	Resolved
SPARK-7140	Do not scan all values in Vector.hashCode	Xiangrui Meng	Resolved
SPARK-7155	SparkContext's newAPIHadoopFile does not support comma-separated list of files, but the other API hadoopFile does.	Yong Tang	Resolved
SPARK-7187	Exceptions in SerializationDebugger should not crash user code	Andrew Or	Resolved
SPARK-7196	decimal precision lost when loading DataFrame from JDBC	L. C. Hsieh	Resolved
SPARK-7229	SpecificMutableRow should take integer type as internal representation for DateType	Cheng Hao	Resolved
SPARK-7234	When codegen on DateType defaultPrimitive will throw type mismatch exception	Chen Song	Resolved
SPARK-7278	Inconsistent handling of dates in PySparks Row object	Karl-Johan Wettin	Resolved
SPARK-7330	JDBC RDD could lead to NPE when the date field is null	Adrian Wang	Resolved
SPARK-7345	Spark cannot detect renamed columns using JDBC connector	Oleg Sidorkin	Resolved
SPARK-7436	Cannot implement nor use custom StandaloneRecoveryModeFactory implementations	Jacek Lewandowski	Resolved
SPARK-7552	Close files correctly when iteration is finished in WAL recovery	Saisai Shao	Closed
SPARK-7558	Log test name when starting and finishing each test	Andrew Or	Closed
SPARK-7566	HiveContext.analyzer cannot be overriden	Santiago M. Mola	Resolved
SPARK-7621	Report KafkaReceiver MessageHandler errors so StreamingListeners can take action	Jeremy A. Lucas	Resolved
SPARK-7624	Task scheduler delay is increasing time over time in spark local mode	Davies Liu	Resolved
SPARK-7668	Matrix.map should preserve transpose property	L. C. Hsieh	Resolved
SPARK-7946	DecayFactor wrongly set in StreamingKMeans	Manoj Kumar	Resolved
SPARK-8032	Make NumPy version checking in mllib/__init__.py	Manoj Kumar	Resolved
SPARK-8451	SparkSubmitSuite never checks for process exit code	Andrew Or	Closed
SPARK-8535	PySpark : Can't create DataFrame from Pandas dataframe with no explicit column name	Yuri Saito	Resolved
SPARK-8563	Bug that IndexedRowMatrix.computeSVD() yields the U with wrong numCols	19 Lee	Resolved
SPARK-9236	Left Outer Join with empty JavaPairRDD returns empty RDD	François Garillot	Resolved
SPARK-9254	sbt-launch-lib.bash should use `curl --location` to support HTTP/HTTPS redirection	Cheng Lian	Resolved
SPARK-10353	MLlib BLAS gemm outputs wrong result when beta = 0.0 for transpose transpose matrix multiplication	Burak Yavuz	Resolved
SPARK-10642	Crash in rdd.lookup() with "java.lang.Long cannot be cast to java.lang.Integer"	L. C. Hsieh	Resolved
SPARK-10657	Remove legacy SCP-based Jenkins log archiving code	Josh Rosen	Resolved
SPARK-10973	__gettitem__ method throws IndexError exception when we try to access index after the last non-zero entry.	Maciej Szymkiewicz	Resolved
SPARK-10980	Create wrong decimal if unscaled > 1e18 and scale > 0	Davies Liu	Resolved
SPARK-11812	pyspark reduceByKeyAndWindow does not handle unspecified invFunc (invFunc=None)	David Tolpin	Resolved
SPARK-13464	Fix failed test test_reduce_by_key_and_window_with_none_invFunc in pyspark/streaming	L. C. Hsieh	Resolved
SPARK-2168	History Server renered page not suitable for load balancing	Lukasz Jastrzebski	Resolved
SPARK-5634	History server shows misleading message when there are no incomplete apps	Marcelo Masiero Vanzin	Closed
SPARK-5783	Include filename, line number in eventlog-parsing error message	Ryan Williams	Resolved
SPARK-6205	UISeleniumSuite fails for Hadoop 2.x test with NoClassDefFoundError	Sean R. Owen	Resolved
SPARK-6343	Make doc more explicit regarding network connectivity requirements	Peter Parente	Resolved
SPARK-6636	Use public DNS hostname everywhere in spark_ec2.py	Matt Aasted	Resolved
SPARK-6753	Unit test for SPARK-3426 (in ShuffleSuite) doesn't correctly clone the SparkConf	Kay Ousterhout	Resolved
SPARK-6860	Fix the possible inconsistency of StreamingPage	Shixiong Zhu	Resolved
SPARK-6868	Container link broken on Spark UI Executors page when YARN is set to HTTPS_ONLY	Dean Chen	Resolved
SPARK-6878	Sum on empty RDD fails with exception	Erik van Oosten	Resolved
SPARK-6952	spark-daemon.sh PID reuse check fails on long classpath	Punya Biswal	Resolved
SPARK-6975	Argument checking conflict in Yarn when dynamic allocation is enabled	Saisai Shao	Closed
SPARK-6988	Fix Spark SQL documentation for 1.3.x	Olivier Girardot	Resolved
SPARK-6992	Spark SQL documentation for programmatically adding a Schema is broken for Java API	Olivier Girardot	Resolved
SPARK-7036	ALS.train should support DataFrames in PySpark	Xiangrui Meng	Resolved
SPARK-7039	JdbcRdd doesn't support java.sql.Types.NVARCHAR	Shuai Zheng	Resolved
SPARK-7084	Improve the saveAsTable documentation	madhukara phatak	Resolved
SPARK-7323	Use insertAll instead of individual insert while merging combiners	Mridul Muralidharan	Resolved
SPARK-7522	ML Examples option for dataFormat should not be enclosed in angle brackets	Bryan Cutler	Resolved
SPARK-7651	PySpark GMM predict, predictSoft should fail on bad input	Meethu Mathew	Resolved
SPARK-7744	"Distributed matrix" section in MLlib "Data Types" documentation should be reordered.	Mike Dusenberry	Resolved
SPARK-8098	Show correct length of bytes on log page	Carson Wang	Resolved
SPARK-8126	Use temp directory under build dir for unit tests	Marcelo Masiero Vanzin	Resolved
SPARK-8400	ml.ALS doesn't handle -1 block size	Bryan Cutler	Resolved
SPARK-8525	Bug in Streaming k-means documentation	Oleksiy Dyagilev	Resolved
SPARK-8541	sumApprox and meanApprox doctests are incorrect	Scott Taylor	Resolved
SPARK-8865	Fix bug: init SimpleConsumerConfig with kafka params	guowei	Resolved
SPARK-9198	Typo in PySpark SparseVector docs (bad index)	Joseph K. Bradley	Resolved
SPARK-9507	Remove dependency reduced POM hack now that shade plugin is updated	Sean R. Owen	Resolved
SPARK-9607	Incorrect zinc check in build/mvn	Ryan Williams	Resolved
SPARK-9608	Incorrect zinc -status check in build/mvn	Ryan Williams	Resolved
SPARK-9633	SBT download locations outdated; need an update	Sean R. Owen	Resolved
SPARK-9801	Spark streaming deletes the temp file and backup files without checking if they exist or not	Hao Zhu	Resolved
SPARK-10354	First cost RDD shouldn't be cached in k-means\|\| and the following cost RDD should use MEMORY_AND_DISK	Xiangrui Meng	Resolved
SPARK-10556	SBT build explicitly sets Scala version, which can conflict with SBT's own scala version	Ahir Reddy	Resolved
SPARK-11813	Avoid serialization of vocab in Word2Vec	yuhao yang	Resolved
SPARK-12363	PowerIterationClustering test case failed if we deprecated KMeans.setRuns	L. C. Hsieh	Resolved
SPARK-6767	Documentation error in Spark SQL Readme file	Tijo Thomas	Resolved
SPARK-7883	Fixing broken trainImplicit example in MLlib Collaborative Filtering documentation.	Mike Dusenberry	Resolved

1–106 of 106