In Re Modafinil Antitrust Litigation
This text of 837 F.3d 238 (In Re Modafinil Antitrust Litigation) is published on Counsel Stack Legal Research, covering Court of Appeals for the Third Circuit primary law. Counsel Stack provides free access to over 12 million legal documents including statutes, case law, regulations, and constitutions.
Opinions
OPINION
SMITH, Circuit Judge.
“The class action is an ingenious device for economizing on the expense of litigation and enabling small claims to be litigated. The two points are closely related. If every small claim had to be litigated separately, the vindication of small claims would be rare. The fixed costs of litigation make it impossible.” Thorogood v. Sears, Roebuck and Co., 547 F.3d 742, 744 (7th Cir. 2008). But not every group of plaintiffs should be granted class action status, because “[t]he class action is an ‘exception to the usual rule that litigation is conducted by and on behalf of the individual named parties only.’ ” Wal-Mart Stores, Inc. v. Dukes, 564 U.S. 338, 348, 131 S.Ct. 2541, 180 L.Ed.2d 374 (2011) (quoting Califano v. Yamasaki, 442 U.S. 682, 700-01, 99 S.Ct. 2545, 61 L.Ed.2d 176 (1979)).
When thinking of a class action brought under Rule 23(b)(3), we typically think of a large aggregation of individuals (hundreds or even thousands), each with small claims. This case is quite different from that. Here, we are faced with a putative class of twenty-two large and sophisticated corporations, most of which have multi-million dollar claims, who wish to take advantage of the class action device. While we do not foreclose the possibility of class status in this case, or where the putative class is of similar composition, Plaintiffs have not met their burden of showing that the nu-merosity requirement of Rule 23(a)(1) has been satisfied. We now provide a framework for district courts to apply when conducting their numerosity analyses, and we will remand to the District Court to allow such an analysis in this case.
I.
A. Regulatory Framework
The 1984 Drug Price Competition and Patent Term Restoration Act (the “Hatch-Waxman Act”), 98 Stat. 1585, as amended, provides a regulatory framework designed in part to (1) ensure that only rigorously tested drugs are marketed, (2) incentivize drug manufacturers to invest in new re[243]*243search and development, and (3) encourage generic entry into the marketplace. The Hatch-Waxman Act requires a drug manufacturer wishing to market a new brand-name drug to first submit a New Drug Application (“NDA”) to the federal Food and Drug Administration (“FDA”), and then undergo a long, complex, and costly testing process. See 21 U.S.C. § 355(b)(1) (requiring, among other things, “full reports of investigations” into safety and effectiveness; “a full list of the articles used as components”; and a “full description” of how the drug is manufactured, processed, and packed); see also F.T.C. v. Actavis, Inc., — U.S. -, 133 S.Ct. 2223, 2228-29, 186 L.Ed.2d 343 (2013) (describing the statutory framework). If this process is successful, the FDA will grant the drug manufacturer approval to market the brand-name drug. After this approval, a generic manufacturer can obtain similar approval by submitting an Abbreviated New Drug Application (“ANDA”) that “shows that the generic drug has the same active ingredients' as, and is biologically equivalent to, the brand-name drug.” Caraco Pharm. Labs., Ltd. v. Novo Nordisk A/S, — U.S. -, 132 S.Ct. 1670, 1676, 182 L.Ed.2d 678 (2012) (citing 21 U.S.C. §§ 355(j)(2)(A)(ii), (iv)). This way, a generic manufacturer is not required to undergo the same costly approval procedures to develop a drug that has already satisfied the FDA. Actavis, 133 S.Ct. at 2228 (“The Hatch-Waxman process, by allowing the generic to piggy-back on the pioneer’s approval efforts, ‘speed[s] the introduction of low-cost generic drugs to market,’ thereby furthering drug competition.” (quoting Carnco, 132 S.Ct. at 1676)).
The FDA will not give final approval to produce a generic version of a drug that is entitled to non-patent exclusivity under the Hatch-Waxman Act, and it. “cannot authorize a generic drug that would infringe a patent.” Caraco, 132 S.Ct. at 1676. Thus, among other things, an ANDA’s approval will depend on “the scope and duration of the patents covering the brand-name drug.” Id. Brand manufacturers are required to include the patent number and expiration date of the patent that covers the drug or that covers a method of using that drug in their NDAs, which are then published by the FDA in the Orange Book, more formally known as the Approved Drug Products with Therapeutic Equivalence Evaluations. Id. (citing 21 U.S.C. § 355(b)(1) and 21 C.F.R. §§ 314-.53(c)(2)(ii)(P)(3), (3) (2011)). Once a patent has been listed in the Orange Book, the generic manufacturer is free to file an ANDA if it can certify that its .proposed generic drug will not actually violate the brand manufacturer’s patents. Id. Under 21 U.S.C. § 355(j)(2)(A)(vii), there are four ways in which a generic manufacturer can make this certification:
(I) that such patent information has not been filed,
(II) that such patent has expired,-
(III) of the date on which such patent will expire, or
(IV) that such patent is invalid or will not be infringed by the manufacture, use, or sale of the new drug for which the application is submitted.
An ANDA with a paragraph IV certification may only be filed after the expiration of the fourth year of the New Chemical Entity (“NCE”) five-year exclusivity period.1 21 U.S.C. §355(j)(5)(E)(ii). The “ ‘paragraph IW route[ ] automatically counts as patent infringement.” Actavis, 133 S.Ct. at 2228 (citing 35 U.S.C. [244]*244§ 271(e)(2)(A)). As a result, this often “means provoking litigation” instituted by the brand manufacturer. Carnco, 132 S.Ct. at 1677.
'If the brand manufacturer initiates a patent infringement suit, the FDA must withhold approval of the generic for at least 30 months while the parties litigate the validity or infringement of the patent. Actavis, 133 S.Ct. at 2228 (citing 21 U.S.C. § 355(j)(5)(B)(iii)). If the suit has concluded at the end of this 30-month period, then the FDA will follow the outcome of the litigation. Id. However, if the litigation is still proceeding; the FDA may give its approval to the generic drug manufacturer to begin marketing a generic version of the drug. Id. The generic manufacture then has the option to “launch at risk,” meaning that if the ongoing court proceeding ultimately determines that the patent was valid and infringed, the generic firm will be liable for lost profits despite the FDA’s approval. C. Scott Hemphill, Paying for Delay: Pharmaceutical Patent Settlement as a Regulatory Design Problem, 81 N.Y.U. L. Rev. 1553, 1609 (2006).
In order to incentivize a generic drug manufacturer to challenge weak patents, the Hatch-Waxman Act provides that the first generic manufacturer to file a paragraph IV certification will enjoy a 180-day exclusivity period. 21 U.S.C.
Free access — add to your briefcase to read the full text and ask questions with AI
OPINION
SMITH, Circuit Judge.
“The class action is an ingenious device for economizing on the expense of litigation and enabling small claims to be litigated. The two points are closely related. If every small claim had to be litigated separately, the vindication of small claims would be rare. The fixed costs of litigation make it impossible.” Thorogood v. Sears, Roebuck and Co., 547 F.3d 742, 744 (7th Cir. 2008). But not every group of plaintiffs should be granted class action status, because “[t]he class action is an ‘exception to the usual rule that litigation is conducted by and on behalf of the individual named parties only.’ ” Wal-Mart Stores, Inc. v. Dukes, 564 U.S. 338, 348, 131 S.Ct. 2541, 180 L.Ed.2d 374 (2011) (quoting Califano v. Yamasaki, 442 U.S. 682, 700-01, 99 S.Ct. 2545, 61 L.Ed.2d 176 (1979)).
When thinking of a class action brought under Rule 23(b)(3), we typically think of a large aggregation of individuals (hundreds or even thousands), each with small claims. This case is quite different from that. Here, we are faced with a putative class of twenty-two large and sophisticated corporations, most of which have multi-million dollar claims, who wish to take advantage of the class action device. While we do not foreclose the possibility of class status in this case, or where the putative class is of similar composition, Plaintiffs have not met their burden of showing that the nu-merosity requirement of Rule 23(a)(1) has been satisfied. We now provide a framework for district courts to apply when conducting their numerosity analyses, and we will remand to the District Court to allow such an analysis in this case.
I.
A. Regulatory Framework
The 1984 Drug Price Competition and Patent Term Restoration Act (the “Hatch-Waxman Act”), 98 Stat. 1585, as amended, provides a regulatory framework designed in part to (1) ensure that only rigorously tested drugs are marketed, (2) incentivize drug manufacturers to invest in new re[243]*243search and development, and (3) encourage generic entry into the marketplace. The Hatch-Waxman Act requires a drug manufacturer wishing to market a new brand-name drug to first submit a New Drug Application (“NDA”) to the federal Food and Drug Administration (“FDA”), and then undergo a long, complex, and costly testing process. See 21 U.S.C. § 355(b)(1) (requiring, among other things, “full reports of investigations” into safety and effectiveness; “a full list of the articles used as components”; and a “full description” of how the drug is manufactured, processed, and packed); see also F.T.C. v. Actavis, Inc., — U.S. -, 133 S.Ct. 2223, 2228-29, 186 L.Ed.2d 343 (2013) (describing the statutory framework). If this process is successful, the FDA will grant the drug manufacturer approval to market the brand-name drug. After this approval, a generic manufacturer can obtain similar approval by submitting an Abbreviated New Drug Application (“ANDA”) that “shows that the generic drug has the same active ingredients' as, and is biologically equivalent to, the brand-name drug.” Caraco Pharm. Labs., Ltd. v. Novo Nordisk A/S, — U.S. -, 132 S.Ct. 1670, 1676, 182 L.Ed.2d 678 (2012) (citing 21 U.S.C. §§ 355(j)(2)(A)(ii), (iv)). This way, a generic manufacturer is not required to undergo the same costly approval procedures to develop a drug that has already satisfied the FDA. Actavis, 133 S.Ct. at 2228 (“The Hatch-Waxman process, by allowing the generic to piggy-back on the pioneer’s approval efforts, ‘speed[s] the introduction of low-cost generic drugs to market,’ thereby furthering drug competition.” (quoting Carnco, 132 S.Ct. at 1676)).
The FDA will not give final approval to produce a generic version of a drug that is entitled to non-patent exclusivity under the Hatch-Waxman Act, and it. “cannot authorize a generic drug that would infringe a patent.” Caraco, 132 S.Ct. at 1676. Thus, among other things, an ANDA’s approval will depend on “the scope and duration of the patents covering the brand-name drug.” Id. Brand manufacturers are required to include the patent number and expiration date of the patent that covers the drug or that covers a method of using that drug in their NDAs, which are then published by the FDA in the Orange Book, more formally known as the Approved Drug Products with Therapeutic Equivalence Evaluations. Id. (citing 21 U.S.C. § 355(b)(1) and 21 C.F.R. §§ 314-.53(c)(2)(ii)(P)(3), (3) (2011)). Once a patent has been listed in the Orange Book, the generic manufacturer is free to file an ANDA if it can certify that its .proposed generic drug will not actually violate the brand manufacturer’s patents. Id. Under 21 U.S.C. § 355(j)(2)(A)(vii), there are four ways in which a generic manufacturer can make this certification:
(I) that such patent information has not been filed,
(II) that such patent has expired,-
(III) of the date on which such patent will expire, or
(IV) that such patent is invalid or will not be infringed by the manufacture, use, or sale of the new drug for which the application is submitted.
An ANDA with a paragraph IV certification may only be filed after the expiration of the fourth year of the New Chemical Entity (“NCE”) five-year exclusivity period.1 21 U.S.C. §355(j)(5)(E)(ii). The “ ‘paragraph IW route[ ] automatically counts as patent infringement.” Actavis, 133 S.Ct. at 2228 (citing 35 U.S.C. [244]*244§ 271(e)(2)(A)). As a result, this often “means provoking litigation” instituted by the brand manufacturer. Carnco, 132 S.Ct. at 1677.
'If the brand manufacturer initiates a patent infringement suit, the FDA must withhold approval of the generic for at least 30 months while the parties litigate the validity or infringement of the patent. Actavis, 133 S.Ct. at 2228 (citing 21 U.S.C. § 355(j)(5)(B)(iii)). If the suit has concluded at the end of this 30-month period, then the FDA will follow the outcome of the litigation. Id. However, if the litigation is still proceeding; the FDA may give its approval to the generic drug manufacturer to begin marketing a generic version of the drug. Id. The generic manufacture then has the option to “launch at risk,” meaning that if the ongoing court proceeding ultimately determines that the patent was valid and infringed, the generic firm will be liable for lost profits despite the FDA’s approval. C. Scott Hemphill, Paying for Delay: Pharmaceutical Patent Settlement as a Regulatory Design Problem, 81 N.Y.U. L. Rev. 1553, 1609 (2006).
In order to incentivize a generic drug manufacturer to challenge weak patents, the Hatch-Waxman Act provides that the first generic manufacturer to file a paragraph IV certification will enjoy a 180-day exclusivity period. 21 U.S.C. § 355(j)(5)(B)(iv). This means that during this exclusivity period, “no other generic can compete with the brand-name drug,” Actavis, 133 S.Ct. at 2229, an opportunity that can be “ ‘worth several hundred million dollars,’ ” to the first-filer, id. (quoting Hemphill, supra, at 1579).2 It is during this generic exclusivity period- that the “vast majority of potential profits for a generic drug manufacturer materialize.” Id. (internal quotation marks omitted). That is because once the exclusivity period has expired other generic manufacturers are free to enter the market, bringing the price down to competitive levels. Importantly, this 180-day exclusivity period- belongs only to the first generic manufacturer to file; if the first-filer forfeits its exclusivity rights, no othér generic manufacturer is entitled to it. Id. (citing 21 U.S.C. § 355(j)(5)(D)).
B. Facts
In April 1997, the United States Patent and Trademark Office issued U.S. Patent No. 5,618,845 (“the '845 patent”) to Cephal-on, Inc. (“Cephalon”), a pharmaceutical company. The '845 patent claimed a specific particle-size distribution of modafinil, a wakefulness-promoting agent used to treat narcolepsy and other sleep disorders, and Cephalon later applied for a reissue of the patent, resulting in the issuance of U.S. Reissue Patent No. 37,516 (“the '516 patent”) in January 2002. Thus, Cephalon’s use of modafinil was protected by a patent until October 6, 2014, to be later extended until April 6, 2015.
In December 1998, the FDA. approved Cephalon’s NDA for the brand-name drug Provigil and granted it NCE exclusivity. This five-year period of exclusivity was extended until December 24, 2005, due to Cephalon’s status as an orphan drug.3 In [245]*245March 2006, Cephalon obtained pediatric exclusivity, which added an additional six months of exclusivity. 21 U.S.C, §355a(c). Thus, in the, absence of the '516 patent, Cephalon’s exclusivity period for modafinil would have ended on June 24,2006.
On December 24, 2002, the first day that an ANDA for modafinil could be filed, four generic drug manufacturers — Teva Pharmaceutical Industries,,Ltd. and Teva Pharmaceuticals, USA, Inc., (collectively “Teva”); Ranbaxy Laboratories, Ltd. and Ranbaxy Pharmaceuticals, Inc. (collectively “Ranbaxy”); Mylan Pharmaceuticals, Inc. and Mylan Inc. (collectively “Mylan”); and Barr Laboratories, Inc. (“Barr”)— each independently filed an ANDA with paragraph IV certifications seeking to sell generic modafinil products. Due to FDA guidance promulgated after the paragraph IV certifications in this case were filed, all four generic manufacturers were treated as being the first filer, and thus all four would have shared in the 180-day exclusivity period,' making it less valuable to each individual generic manufacturer. See Guidance for Industry on 180-Day Exclusivity when Multiple Abbreviated New Drug Applications are Submitted on the Same Day, 68 Fed. Reg. 45252, 45255 (Aug. 1, 2003).
Because the filing of the paragraph IV certification “automatically counts as patent infringement,” Actavis, 133 S.Ct. at 2228 (citing 35 U.S.C. § 271(e)(2)(A)), Ce-phalon sued the four generic manufacturers for patent infringement in the District of New Jersey on March 28, 2003. While motions for summary judgment were pending, Cephalon entered into what are known as “reverse-payment settlements”4 with each of the four generic manufacturers. First, Cephalon settled with Teva on December 9, 2005. This agreement ended the patent litigation between Cephalon and Teva, and as a result Teva was granted a license to sell modafinil in October 2012, which was before the expiration of Cephal-on’s patent but several years later than Teva could have entered the! market if it had launched its generic “at-risk.” In exchange for its agreement to settle, Teva was paid millions of dollars to stay out of the market viá royalty agreements, supply agreements, and other' contractual provisions. Importantly, the only term of the deal that was publicized was what is known as the “contingent launch provision.” This provision allowed Teva to enter the generic modafinil market if any other company entered the market for any reason.
Almost two weeks later, on December 22, 2005, Ranbaxy entered into a similar reverse-payment settlement agreement with Cephalon on slightly less favorable terms, but also with a contingent launch provision. Again, the contingent launch provision was .publicized via press release. Two weeks after the Ranbaxy settlement, on January 9, 2006, Mylan entered into a similar agreement — on less favorable terms than Ranbaxy — but also with a publicized contingent, launch provision. The final remaining paragraph IV filer, Barr, settled on the least favorable terms on [246]*246February 1, 2006. It too had a contingent launch provision, which was publicized as well. Because no subsequent paragraph IV filer would be entitled to the 180-day exclusivity period, there was no incentive for another generic manufacturer to unilaterally bear the litigation expenses for the reward that it would have to share with any other generic manufacturer who wanted to enter the market. See 21 U.S.C. § 356(j)(5)(D).5
The Direct Purchaser Plaintiff (“DPP”) putative class, appellees in this case, filed suit on April 27, 2006, alleging a global conspiracy involving Cephalon and all four generic defendants under 15 U.S.C. § 1; four separate conspiracies between Ce-phalon and each generic defendant under the same statute; and a monopolization claim against Cephalon under 15 U.S.C. § 2. The DPP class is made up of wholesalers who purchased Provigil directly from Cephalon.6
The District Court, with the full support of the parties, ordered that motions regarding class certification were not to be filed until after fact and expert discovery and the motions for summary judgment had been filed. Thus, the DPP class did not file its motion for class certification until May 12, 2014, after more than eight years of litigation. Approximately one month later, on June 23, 2014, the District Court granted summary judgment in favor of all of the defendants on the DPP class’ global antitrust conspiracy claim. Over the next 13 months, several letter motions and hearings were held on the class certification issue, and the District Court certified the DPP class on July 27, 2015. During this period, Cephalon, Teva, and Barr settled with the DPP class for $512 million on April 17, 2015. A settlement class, which has the exact same composition as the putative DPP class at issue here, was also certified on July 27, 2015, and the settlement itself was approved by the District Court on October 15, 2015. Thus, the only defendants remaining at the time of the DPP certification decision being appealed were Ranbaxy and Mylan (collectively “Defendants”).
II.
Defendants challenge two aspects of the District Court’s class certification decision — numerosity and predominance. Thus, even though other issues were contested at the District Court level, we focus only on the District Court’s numerosity analysis under Rule 23(a)(1) and its predominance analysis under Rule 23(b)(3).
[247]*247Plaintiffs argued before the District Court that the class was comprised of twenty-two members. Defendants challenged the inclusion of four of these members. Thus, the District Court began its numerosity analysis by determining the proper class size because “relevant precedent makes significant distinctions between classes containing more than twenty class members and those containing twenty or fewer.” King Drug Co. of Florence, Inc. v. Cephalon, Inc., 309 F.R.D. 195, 204 (E.D. Pa. 2015). Defendants challenged two class members’ inclusion solely for numerosity purposes because they were partial assignees of two other class members. They argued that counting the partial assignees would essentially allow the DPP class to “double dip” and artificially inflate the class size. Defendants next challenged the inclusion of a class member that ceased operations prior to generic entry actually occurred, arguing that there was no way to know whether it would have actually purchased generic modafinil. Lastly, Defendants challenged the class .status of a class member that only purchased branded Provigil from. Cephalon after generic modafinil had already entered the market. Defendants argued, that there was no overcharge as .a result of this. The District Court rejected all of Defendants’ challenges to the class size. Id. at 204-06.
The District Court next considered whether joinder of these twenty-two class members was impracticable such that class certification was appropriate under Rule 23(a)(1). While the District Court acknowledged' that a class of twenty-two members was small compared to most class actions, the District Court found persuasive several district court cases in the reverse-payment settlement context with similarly-situated classes where the numerosity requirement was found to be satisfied. Id. at 204 (collecting cases)
In analyzing whether joinder was impracticable, the District Court examined five factors: “(1) judicial economy, (2) geographic dispersion, (3) financial resources of class members, (4) the claimant’s ability to institute individual suits, and (5) requests for injunctive relief that could affect future class members.” Id. at 203-04 (quoting In re Wellbutrin XL Antitrust Litig., No. 08-2431, 2011 WL 3563385, at *3 (E.D. Pa. Aug. 11, 2011)). The District Court placed great weight on the judicial economy factor, with particular emphasis on the late stage of the litigation. Specifically, the Court stated: “Considering the extensive history of this litigation and the exhaustive discovery that has been conducted, ... judicial economy is best served by trying this case as a class action. Joinder of the absent class members would likely require additional rounds of discovery, which would only further delay a trial date.” Id. at 206-07.
Relatedly, the District Court also expressed the concern that if the class was not certified at this late date, unnamed class members would bring individual suits in other jurisdictions instead of seeking to be joined in the suit before him.- Id. at 207 (“Further, if cases were brought within other jurisdictions, additional discovery is certainly a possibility, and separate trials could result in inconsistent verdicts.”). The other primary factor that the District Court found to weigh in favor of numerosity was the geographic dispersion of the class members, who were spread out over thirteen states and Puerto Rico. Id.
On the other hand, the Court noted that some factors weighed against class certification. First, the class members’ vast financial resources weighed against certification, as each was a sophisticated corporation. Id. The District Court also looked to their incentive to bring individual claims, stating that the class members’ [248]*248ability to bring individual suits generally weighed against certification, but equivocating somewhat because the six class members with claims below $1 million “likely do not have the same incentive to engage in costly antitrust litigation on their own.” Id. It is not clear what weight was ultimately placed on the parties’ financial incentive to bring suit, and the District Court appeared to treat this as either a neutral factor or one that weighed in favor of Defendants. Ultimately, the District Court held that the requirements of Rule 23(a)(1) were satisfied and the class was sufficiently numerous such that joinder was impracticable. Id.
The District Court next addressed Defendants’ predominance argument that, after the District Court’s grant of summary judgment on the global conspiracy claim, common issues of law and fact did not predominate over individualized inquiries under Rule 23(b)(3). Id. at 209. Defendants argued that the damages model of Plaintiffs’ expert, Dr. Leitzinger, no longer matched Plaintiffs’ theory of liability because it did not isolate the harm caused by each individual reverse-payment settlement. Defendants claimed that this mismatch is analogous to the problem at issue in the Supreme Court’s decision in Comcast Corp. v. Behrend, — U.S. -, 133 S.Ct. 1426, 185 L.Ed.2d 515 (2013). Related to this argument, Defendants relied upon the doctrine of antitrust standing to support the view that, in the absence of a global conspiracy, each class member would have to show which agreement harmed him, and that this would necessarily be an individualized inquiry. The District Court rejected these arguments, concluding that Plaintiffs had antitrust standing and that the doctrine of joint and several liability was appropriate. Thus, it concluded that Comcast was -not controlling because Dr. Leitzinger’s damages model “matches Plaintiffs’ remaining theory of liability and impact.” Id. at 214.
III.
“The class action is an exception to the usual rule that litigation is conducted by and on behalf of the individual named parties -only.” Wal-Mart, 564 U.S. at 348, 181 S.Ct. 2541 (internal quotation marks omitted). In order to justify this exception to the rule, “every putative class action must satisfy the 'four requirements of Rule 23(a)- and the requirements of either Rule 23(b)(1), (2), or (3).” Marcus v. BMW of N.A., LLC, 687 F.3d 583, 590 (2012). In order to satisfy Rule 23(a); a plaintiff must show:
(1) the class must be “so numerous that joinder of all members is impracticable” (numerosity); (2) there must be “questions of law or fact common to the class” (commonality); (3) “the claims or defenses of the representative parties” must be “typical of the claims or defenses of the class” - (typicality); and (4) the named plaintiffs ihust “fairly and adequately protect the interests of the class” (adequacy of representation, or simply adequacy).
In re Cmty. Bank of N. Va., 622 F.3d 275, 291 (3d Cir. 2010) (quoting Fed. R. Civ. P. 23). Rule 23(b)(3), which is the basis for certification here, “requires that (i) Common questions of law or fact predominate (predominance), and (ii) the class action is the superior method for adjudication (superiority).” Marcus, 687 F.3d at 591 (quoting In re Cmty. Bank of N. Va., 622 F.3d at 291). “The party seeking certification bears the burden of establishing each element of Rule 23 by a preponderance of the evidence.” Id.
We have held that “the decision to certify a class calls for findings by the court, not merely a ‘threshold showing’ by a party, that each requirement of Rule 23 [249]*249is met,” and that “[fjactual determinations supporting Rule 23 findings must be made by a preponderance of the.evidence.” In re Hydrogen Peroxide Antitrust Litig., 552 F.3d 305, 307 (3d Cir. 2008). In addition, a court “must resolve all factual or legal disputes relevant to class certification, even if they overlap .with the merits— including disputes touching on elements of the cause of action.” Id Class certification will thus be “proper-only ‘if the trial court, is satisfied, after, a rigorous analysis, that the prerequisites’ of Rule 23 are met.” Id. (quoting Gen. Tel Co. of Sw. v. Falcon, 457 U.S. 147, 161, 102 S.Ct. 2364, 72 L.Ed.2d 740 (1982)); see also Newton v. Merrill Lynch, Pierce, Fenner & Smith, Inc., 259 F.3d 154, 166 (3d Cir. 2001) (“A class certification decision requires a thorough examination of the factual and legal allegations.”).
Thus, while a district court “possesses broad discretion to control proceedings and frame issues for consideration under Rule 23,” such discretion “does not soften the rule [that] a class may not be certified without a finding that each Rule 23 requirement is met.” Hydrogen Peroxide, 552 F.3d at 310. This is particularly true because, acknowledging the practicalities of class litigation, we have said that class certification “is often the defining moment in class actions (for it may sound the ‘death knell’ of the litigation on the part of plaintiffs, or create unwarranted pressure to settle nonmeritorious claims on the part of defendants).” Newton, 259 F.3d at 162.
“We review a class certification order for abuse of discretion, which occurs if the district court’s decision rests upon a clearly erroneous finding of fact, an errant conclusion of law or an improper application of law ;to fact.” Hydrogen Peroxide, 552 F.3d at 312 (internal quotation marks omitted). Although Defendants raise the issue of predominance first, the requirements of Rule 23(a) are “threshold requirements,” Amchem Prods., Inc. v. Windsor, 521 U.S. 591, 613, 117 S.Ct. 2231, 138 L.Ed.2d 689 (1997), and we therefore address them first.
A. Numerosity
Rule 23(a)(1) sets forth what is commonly known as the numerosity requirement. The text is, however, conspicuously devoid of any numerical minimum required for class certification. Instead, the rule, simply states that the numerosity requirement is satisfied when “the class is so numerous that joinder of all members is impracticable.” Fed R. Civ. P. 23(a)(1). “Impracticable does not- mean impossible,” Robidoux v. Celani, 987 F.2d 931, 935 (2d Cir. 1993), and refers rather to the difficulties of achieving joinder. This calls for an inherently fact-based analysis that requires a district court judge to “take into account the context of the particular case,” thereby'providing district courts considerable discretion in making numerosity determinations. Pa. Pub. Sch. Emps. Ret. Sys. v. Morgan Stanley & Co., 772 F.3d 111, 120 (2d Cir. 2014). A district court abuses that discretion, however, when it considers issues that have no place -in the numerosity requirement. Hydrogen Peroxide, 552 F.3d at 312. In this case, the District Court abused its discretion by improperly emphasizing the late stage of the proceeding and by not considering the ability of individual class members to pursue their cases through the use of joinder.7
While “[n]o minimum number of plaintiffs is required to maintain a suit as a class action,” our Court has said that “gen[250]*250erally if the named plaintiff demonstrates that the potential number of plaintiffs exceeds 40, the first prong of Rule 23(a) has been met.” Stewart v. Abraham, 275 F.3d 220, 226-27 (3d Cir. 2001); see also Robidoux, 987 F.2d at 936 (“[T]he difficulty in joining as few as 40 putative class members should raise a presumption that join-der is impracticable.”). At the other end of the spectrum, the Supreme Court has stated in dicta that a class of fifteen was “too small to meet the numerosity requirement.” Gen. Tel. Co. of the Nw, Inc. v. EEOC, 446 U.S. 318, 331, 100 S.Ct. 1698, 64 L.Ed.2d 319 (1980). Leading treatises have collected cases and recognized the general rule that “[a] class of 20 or fewer is usually insufficiently numerous ... [a] class of 41 or more is usually sufficiently numerous.... [while] [classes with between 21 and 40 members are given varying treatment. These midsized classes may or may not meet the numerosity requirement depending on the circumstances of each particular case.” 5 James Wm. Moore, et al., Moore’s Federal Practice § 23.22; see also 5 William B. Rubenstein, Newberg on Class Actions § 3:12 (“As a general guideline ... a class that encompasses fewer than 20 members will likely not be certified absent other indications of impracticability of joinder, while a class of 40 or more members raises a presumption of impracticability of joinder based on numbers alone.” (internal footnotes omitted)); Cox v. Am. Cast Iron Pipe Co., 784 F.2d 1546, 1553 (3d Cir. 1986) (citing Moore favorably).
At this point, we need not specify a “floor” at which a putative class will fail to satisfy the numerosity requirement. Instead, we simply note that the number of class members is the starting point of our numerosity analysis. Although district courts are always under an obligation to ensure that joinder is impracticable, their inquiry into impracticability should be particularly rigorous when the putative class consists of fewer than forty members. Because the District Court certified a class of twenty-two members, which is only slightly above the twenty-member floor suggested by the leading treatises, we first address Defendants’ challenge to the size of the putative class concerning the partial assignment of some claims. After determining that the class is comprised of twenty-two members, we scrutinize the District Court’s numerosity reasoning in this case. Because the District Court erred in its analysis of the two most important factors applicable here, we see no need to examine the other factors and will .remand for the District Court to again engage in a numer-osity inquiry consistent with the reasoning in this opinion.
1. The Size of the Class
The District Court rejected Defendants’ argument that two class members should not be included in the class for numerosity purposes because they were partial assignees of two other class members. If Defendants were correct, the class would be comprised of only twenty class members, not twenty-two. On the other hand, for the first time on appeal, Plaintiffs argue that they have uncovered three more assignees of claims, and that the class consists of twenty-five members.
Defendants appear to have abandoned their partial assignment argument on appeal, arguing in one sentence that “four of the 22 potential class members were improperly included in the class.” Appellant Br. at 48.8 Defendants make no reference [251]*251to case law and rely simply on cursory citations to the record. We could, for good reason, deem these arguments abandoned and waived on appeal. Kost v. Kozakiewicz, 1 F.3d 176, 182 (3d Cir. 1993). However, because we are remanding the numerosity issue to the District Court, we think it appropriate to consider this issue pertaining to the size of the class because the partial assignability issue impacts whether the three additional class members should be included in the class on remand. See Bagot v. Ashcroft, 398 F.3d 262, 256 (3d Cir. 2005) (“This Court has discretionary power to address issues that have been waived”).
Initially, Defendants’ partial assignment argument makes intuitive sense. Why should Plaintiffs be able to take one claim and turn it into two for numerosity purposes? How is this not a form of “double dipping”? Nevertheless, no matter how intuitively appealing this argument may be, it lacks legal support. The text of Rule 23(a)(1) says nothing about the number of claims; instead, it refers to the number of class members. Fed. R. Civ. P. 23(a)(1) (requiring an inquiry into whether “the class is so numerous that joinder of all members is impracticable” (emphasis added)).
Moreover, as the District Court recognized, there is persuasive circuit precedent establishing that partial assignees are appropriately considered to be members of a class. In In re Fine Paper Litigation, 632 F.2d 1081, 1089 (3d Cir. 1980), the state of Washington was the recipient of partial assignments of antitrust claims. It sought to be excluded from the settlement class, and the district court held, among other reasons for denying the right to opt out, that “the state’s assertion of the assigned claims would result in an impermissible fragmentation of the .’.. causes of action.” Id. We reversed and “reject[ed] the defendant’s position that the partial assignments improperly fragment the claim.” Id. at 1090. We looked to section 156 of the Restatement of Contracts for guidance and concluded that “[a]n assignment of a fractional part of a single and entire right against an obligor is operative as if the part had been a separate right.” Id. at 1091; Restatement (First) of Contracts § 156 (“An assignment of either a fractional part of a single and entire right against an obligor ... is operative as to that part or amount to the same extent and in the same mariner as if the part had been a separate right.”).9
At the same time, we held that when the “collective , right to the entire claim” is split, “the partial assignee may not maintain the original suit” unless the obligor has consented in order to protect the “right[ ] of the obligor to be free of successive and repeated suits growing out of the [252]*252same basic facts.” In re Fine Paper Litig., 632 F.2d at 1091. When the obligor does not consent to these separate suits, then these rights are protected by the use of the joinder rules or the class action mechanism. Id-. Thus,- the state of Washington could be made a party that, unlike other class members,- did “not have the right to opt out.” Id. In our case, Defendants are really seeking the opposite of what we said was permissible in Fine Paper Litigation: they want us to say that these two partial assignees must proceed independent of the class.
While Fine Paper Litigation did not address numerosity, we consider its reasoning instructive. Crucially, we held there that a partial assignment “is operative as if the part had been a separate right.” Id. at 1091. Moreover, Fine Paper Litigation envisioned the class action mechanism as a proper tool for partial assignees to participate in, the lawsuit, albeit with fewer individual rights than other claimants. We agree with the District Court that, unless there is evidence that the class plaintiffs are seeking to artificially inflate the number of claimants, partial assignees may properly be treated as class members. On remand, the District Court will need to consider whether the three new assignees that Plaintiffs first mention on appeal should be considered as class members.10 Thus, at this point, we assume that the class consists of twenty-two members. .
2. Impracticability of Joinder
In Marcus, we recognized the three core purposes of the numerosity requirement:
First, it ensures judicial economy. It does so by freeing federal courts from the onerous rule of compulsory joinder inherited from the English Courts of Chancery and the law of equity. Courts no longer have to conduct a single, administratively burdensome action with all interested parties compelled to join and be present. The impracticability of joinder, or numerosity, requirement also promotes judicial economy by sparing courts the burden of having to decide numerous, sufficiently similar individual actions seriatim. As for its second objective, Rule 23(a)(1) creates greater access to judicial relief,' particularly for those persons with claims that would be uneconomical to litigate individually. Finally, the rule prevents putative class representatives and their counsel, when join-der can be easily accomplished, from unnecessarily depriving members of a small class of their right to a day in court to adjudicate their own claims.
687 F.3d at 694-95 (internal citations omitted). However, in Marcus, we had no need to provide a list of factors that should be considered in the numerosity analysis, because it was “[m]ere speculation” that anyone other than the named plaintiff was a class member. Id. at 596-97.
We have not had occasion to list relevant factors that are appropriate for [253]*253district court judges to consider when determining whether joinder would be impracticable. We do so now. This non-exhaustive list includes: judicial economy, the claimants’ ability and motivation to litigate as joined plaintiffs, the financial resources of class members, the geographic dispersion of class members, the ability to identify future claimants, and whether the claims are for injunctive relief or for damages. See 5 Moore’s Federal Practice § 23.22; 5 Newberg on Class Actions § 3.12 (“These factors include: judicial economy arising from avoidance of a multiplicity of actions, geographic dispersion of class members, size of individual claims, financial resources of class members, and the ability of claimants to institute individual suits.”); Pa. Pub. Sch. Emps. Ret. Sys., 772 F.3d at 120 (“However, the numerosity inquiry is not strictly mathematical but must take into account the context of the particular case, in particular whether a class is superior to joinder based on other relevant factors including: (i) judicial economy, (ii) geographic dispersion, (iii) the financial resources of class members, (iv) their ability to sue separately,- and (v) requests for injunctive relief that would involve future class members.” (citing Robidoux, 987 F.2d at 936)).
These factors are only relevant to a binary choice at the certification stage: a class action versus joinder of all interested parties. At this point, we do not consider the possibility that plaintiffs ■ may bring individual suits. After all, the text of Rule 23(a)(1) refers to whether “the class is so numerous that joinder of all members is impracticable,”11 not whether the class is so numerous that failing to certify presents the risk of many separate lawsuits.
While all factors are relevant, we note at the outset that not all are created equal. Instead, both judicial economy and the ability to' litigate as joined parties' are of primary importance. As we have held, judicial economy is one of the purposes behind Rule 23(a)(1) and class actions in general. Marcus, 687 F.3d at 594. The same is true of ensuring that small-value claims have a mechanism by which they can be economically litigated. Id.; Deposit Guaranty Nat'l Bank, Jackson, Miss. v. Roper, 445 U.S. 326, 339, 100 S.Ct. 1166, 63 L.Ed.2d 427 (1980) (‘Where it is not economically feasible to obtain relief within the traditional framework of a multiplicity of small individual suits for damages, aggrieved persons may be without any effective redress unless they may employ the class action device.”). If we were to say that judicial economy and the ability of class members to bring their own suits as named parties weighed in favor of class [254]*254certification, how could the other factors outweigh these considerations even though the core purposes of a class action were being advanced?12 In this case, the District Court’s judicial economy analysis was incorrect, as it improperly placed great weight on the late stage of the proceeding. Additionally, the District Court did not fully explore the ability of class members to join as plaintiffs.
a. Mdicial Economy
Judicial economy, a primary factor frequently cited, looks to the administrative burden that multiple or aggregate claims place upon the courts. Marcus, 687 F.3d at 594 (stating that the numerosity requirement “also promotes judicial economy; by sparing the courts the burden of having to decide numerous, sufficiently similar individual actions seriatim”)-, id. (“Courts no. longer have to conduct a single, administratively burdensome action with all interested parties compelled to join and be present.”). This factor takes into account any efficiency considerations regarding the joinder of all interested parties that the district court deems relevant, including the number of parties and the nature of the action. See 5 Moore’s Federal Practice § 23.22 (instructing a court to consider “the actual, practical difficulties of joining all of the potential class members” by inquiring whether joinder “would be expensive, time-consuming, and logistically unfeásible”). In analyzing judicial economy, we focus on whether the class action mechanism is substantially more efficient than joinder of all parties.
Here, the District Court “conclude[d] that judicial economy [was] best served by trying this case as a class action.” King Drug Co., 309 F.R.D. at 206. It made this decision by looking to “the extensive history of the litigation and the exhaustive discovery that ha[d] been conducted.” Id. It expressed concern that further discovery would delay the case even more, or that unnamed class members would opt to file suit elsewhere, resulting in other civil actions with additional discovery and the potential for inconsistent verdicts. Id. at 206-07 (“Joinder of the absent class members would likely require additional rounds of discovery, which would only further delay a trial date. Further, if cases were brought within other jurisdictions, additional discovery is certainly a possibility.”).13 While these predictions may come true, the late stage of litigation is not by itself an appropriate consideration to take into account as part of a numerosity analysis.14
[255]*255In complex cases such as this antitrust suit, the class certification decision is often delayed until after years of fact and expert discovery have been conducted and dispos-itive motions have been litigated. See Hydrogen Peroxide, 552 F.3d at 324 (“But even with some limits on discovery and the extent of the hearing, the district judge must receive enough evidence, by affidavits, documents, or testimony, to be satisfied that each Rule 23: requirement has been met.” (quoting In re Initial Pub. Offerings Sec. Litig., 471 F.3d 24, 41 (2d Cir. 2006))). Courts routinely refuse to certify classes based on the need to conduct further discovery before being able to properly rule on a class certification motion. See In re New Motor Vehicles Canadian Export Antitrust Litig., 522 F.3d 6, 26-27 (1st Cir. 2008) (noting that the district court erred in preliminarily certifying the class because of the “novelty and complexity of the theories advanced and the gaps in the .evidence proffered”); Valley Drug Co. v. Geneva Pharm., Inc., 350 F.3d 1181, 1192 (11th Cir. 2003) (“[T]he record needed to decide [the class certification] issue remains incomplete because the district court improperly denied Abbott’s request to conduct so-called ‘downstream discovery.’”). However, such decisions do not prejudice a plaintiff; the class certification motion is not denied, but only deferred until after further discovery is conducted.
Conversely, a rule that would allow courts to consider the late stage of litigation and the sunk costs already incurred in their numerosity analyses would place a thumb on the scale in favor of a numerosity finding for no reason other than the fact that the complex nature of a case resulted in the class certification decision being deferred for years. Our-'view is consistent with the 2003 amendments ' to Rule 23(c)(1)(A) which state that the class certification decision should be made “a[t] an early practicable time after a person sues or is sued as a class representative” as opposed to the previous rule which said that the decision be made “as soon as practicable after commencement of an action.” See Fed. R. Civ. P. 23 advisory committee’s notes to 2003 amendment (noting that the “as soon as practicable” designation does not “capture[ ] the many valid reasons that mayjufetify deferring the initial certification decision”). We have recognized that the rule was modified in order to discourage “premature certification determinations.” Richardson v. Bledsoe, 829 F.3d 273, 279-80, 2016 WL 3854216, at *4 (3d Cir. July 15, 2016) (quoting Weiss v. Regal Collections, 385 F.3d 337, 347 (3d Cir. 2004)).15
As the Advisory Committee noted, there are “many valid reasons that may justify deferring the' initial certification decision,” including the need to conduct discovery, a determination of what issues would be presented at trial, and the defendant’s desire to “win dismissal or summary judgment as to the individual plaintiffs without certification and without binding the class that might have been certified.” Fed. R. Civ. P. 23 advisory committee’s notes to 2003 amendment. Thus, while Rule 23(c)(1)(A) now encourages further discovery so that all of the information, and evidence relevant to certification is before a district judge before she makes the certi[256]*256fication decision, the District Court’s analysis here would seem to consider any lengthy period following the filing of a putative class action as weighing in favor of finding numerosity. This cannot be right. Judicial economy does not permit consideration of the sunk costs from past discovery and litigation, or the need to conduct further discovery if the class is not certified.16
Moreover, while the District Court expressed concern that “[j]oinder of the absent class members would likely require additional rounds of discovery,” King Drug, 309 F.R.D. at 206, this does not mean that the , litigation would have, to begin anew for the unnamed class members, If the members all opted to join the case as individual plaintiffs, the District Court could,, in its discretion, limit discovery where “is unreasonably cumulative or duplicative, or can be obtained from some other source that is more convenient, less burdensome, or less expensive.” Fed. R. Civ. P. 26(b)(2)(C)(i). At this point, Defendants have not shown what further discovery they are entitled to; they only claim that they are entitled to further discovery as a matter of due process.17 In addition, as a class, Plaintiffs have been using the same experts. It is not clear that there would be a need for that to change merely because Plaintiffs would be joined as individual parties instead of moving forward as a class.
On remand, when considering the judicial economy factor of the numerosity analysis, the District Court should not take into account the sunk costs of the litigation or the- need to further delay trial were the class not to be certified.18 In, other words, [257]*257without considering the- late stage of the litigation, it should determine whether a class action would have been a substantially more efficient mechanism of litigating this suit than joinder of all parties. This primarily involves considerations of docket control, taking into account practicalities as simple as that of every attorney making an appearance on the record. At the same time, the District Court is free to rely on its superior understanding of how the case has proceeded to date for the purpose of determining whether the class mechanism would have actually been a substantially more efficient use of judicial resources than joinder of the parties at the onset of the litigation.
b. Ability and Motivation to be Joined as Plaintiffs
The second purpose behind the numer-osity requirement is to further the broader class , action goal of providing those with small claims reasonable access to a judicial forum for the resolution of those claims. Thus, the ability and motivation of Plaintiffs to pursue their litigation via joinder is the second factor upon which we focus. See Marcus, 687 F.3d at 594 (stating that the numerosity requirement “creates greater access to judicial relief, particularly for those persons with claims that would be uneconomical to litigate individually”).19
This primarily20 involves an examination of the stakes at issue for the individual claims and the complexity of the litigation, which will typically correlate with the costs of pursuing these claims. Though joinder is certainly more economical for most plaintiffs than pursuing the case alone, it is often still uneconomical for an individual with a negative valué claim’to join a lawsuit.21 After all, each plaintiff may need to hire his own counsel to protect his individual interests — although total litigation costs would still likely be lower due to joint litigation agreements. Similarly, each plaintiff would be subject to discovery, whereas the defendants would have to show a greater need for discovery from unnamed plaintiffs in a class action.22 See Clark v. Universal Builders, Inc., 501 F.2d 324, 340-41 (7th Cir. 1974) (placing the burden on defendants to show the need for discovery from unnamed class members to ensure that the discovery is not requested “as a tactic to take undue advantage of the class members or as a stratagem to reduce the number of claim-ánts” (internal quotation marks omitted)).
[258]*258The District Court did not properly consider this factor, as it focused instead on whether the individual plaintiffs could have brought their own, individual suits.23 However, the numerosity rule does not envision the alternative of individual suits; it considers only the alternative of joinder. Here, the class members, based on the record before us, appear likely to have the ability and incentive to bring suit as joined parties, thus preventing the alleged wrongdoers from escaping liability.24 In fact, three class members, none of whom are named plaintiffs, each have claims estimated at over $1 billion — even before the trebling of damages. These three make up over 97% of the total value of the class claims, and can hardly be considered as candidates who need the aggregative advantages of the class device. While this factor could weigh in favor of class status if the remaining class members had very small claims, that is simply not the case here. Thirteen of the other nineteen class members have claims that are greater than $1 million, the value that the two parties seem to agree is the appropriate figure at which point bringing one’s own [259]*259suit becomes economical. On the other hand, there are only six class members with claims below $1 million each. While it may be uneconomical for these claims to be pursued in individual litigation, there has been no showing that it would be uneconomical for these six class members to be individually joined as parties in a traditional lawsuit. On remand,' the District Court should consider this issue. Even if it were uneconomical for some or all of these six individual plaintiffs to join the suit, the District Court must still determine whether, considering all the other relevant factors, class status — which is “an exception to the usual rule that litigation is conducted by and on behalf of the individual named parties only,” Wal-Mart Stores, 564 U.S. at 348, 131 S.Ct. 2541 — is appropriate here.
The District Court abused its discretion in analyzing the two most important nu-merosity factors when it considered the late stage of the litigation' as relevant to the judicial economy factor and failed to properly consider the ability and motivation of the plaintiffs to proceed as joined, as opposed to individual, parties. We therefore remand for the District Court to conduct a rigorous numerosity analysis for this class of twenty-two (or twenty-five) members. In conducting this rigorous analysis, factors that the District Court may consider include the financial resources of the class members, the geographic dispersion of the class members, the ability to identify future claimants, together with the fact that these claims are for damages, and not injunctive relief.
Contrary to the dissent’s assertion, we are not “erecting roadblocks that do not exist.” Although the dissent suggests that Defendants have not yet shown why join-der is practicable, that suggestion is beside the point.' The burden is on Plaintiffs to show why joinder is impracticable. Marcus, 687 F.3d at 591 (“The party seeking certification bears the burden of establishing each element of Rule 23” — including the numerosity requirement — “by a preponderance of the evidence.”); id. at 595 (“Critically, numerosity — like all Rule 23 requirements — must be proven by a preponderance of the evidence.”). Moreover, the dissent would have Defendants’ inability to articulate an argument against finding numerosity obviate a district court’s obligation to conduct “a rigorous analysis” and determine “that the prerequisites of Rule 23(a) have been satisfied.” Wal-Mart, 564 U.S. at 351, 131 S.Ct. 2541 (internal quotation marks omitted); Hydrogen Peroxide, 552 F.3d at 310 (“[A] class may not be certified without a finding that each Rule 23 requirement is met.”).
Finally, the dissent makes the extravagant claim that “nothing about [this case] cries out for anything but class treatment.” Yet this is not the typical class action where hundreds or thousands of claims are aggregated in order to ensure that the wrongdoer is held accountable and that small claims are vindicated. See Thorogood, 547 F.3d at 744. Putting aside the small number of class members in this case, the judges in the majority have never seen a class action where three class members, each with billions of dollars at stake and close to 100% of the total value of class claims between them, have been allowed to sit on the sidelines as unnamed class members. Plaintiffs must satisfy their burden of showing why we should allow this'unique putative class to take advantage of this “exception to the usual rule that litigation is conducted by and on behalf of the individual named parties only.” Wal-Mart, 564 U.S. at 348, 131 S.Ct. 2541 (internal quotation marks omitted). At this point, they have failed to meet that burden, and any suggestion that this is a run-[260]*260of-the-mill class action ignores the facts of this case.25
B. Predominance.
Although we remand for the District Court to reconsider its numerosity analysis, we also see a need to address Defendants’ predominance argument. This argument makes selective use of language from the Supreme Court’s recent decision in Comcast Corp. v. Behrend, — U.S. -, 133 S.Ct. 1426, 185 L.Ed.2d 515 (2013). The interpretation of Comcast advanced by Defendants is overly broad and simplistic, and, if the class were to meet the numerosity requirement on remand, the predominance argument advanced by Defendants is untenable.
Under Rule 23(b)(3), “questions of law or fact common to class members [must] predominate over any questions affecting only individual members.”26 This “inquiry tests whether proposed clásses are sufficiently cohesive to warrant adjudication by representation.” Amchem Prods., 521 U.S. at 623, 117 S.Ct. 2231. “If anything, Rule 23(b)(3)’s predominance criterion is even more demanding than Rule 23(a),” Comcast, 133 S.Ct. at 1432, as it is “[f]ramed for situations in which ‘class-action treatment is not as clearly called for’ as it is in Rule 23(b)(1) and (b)(2) situations.” Amchem Prods., 521 U.S. at 615, 117 S.Ct. 2231 (quoting Fed. R. Civ. P. 23 advisory committee’s notes to 1966 amendment). This “inquiry is especially dependent upon the merits of a plaintiffs claim, since the nature of the evidence that will suffice to resolve a question determines whether the question is common or individual.” In re Constar Int'l Inc. Sec. Litig., 585 F.3d 774, 780 (3d Cir. 2009) (internal quotation marks omitted). The Supreme Court has noted that “[a]n individual question is one where members of a proposed class will need to present evidence that varies from member to member, while a common question is one where the same evidence will suffice for each member to make a prima facie showing [or] the issue is susceptible to generalized, class-wide proof.” Tyson Foods, Inc. v. Bouaphakeo, — U.S. ——, 136 S.Ct. 1036, 1045, 194 L.Ed.2d 124 (2016) (internal quotation marks omitted).
The predominance requirement applies to damages as well, because the efficiencies of the class action mechanism would be negated if “[questions of individual damage calculations ... overwhelm questions common to the class.” Comcast, 133 S.Ct. at 1433. This does not mean, however, that damages must be “susceptible of measurement across the entire, class for purposes of Rule 23(b)(3).” Neale v. Volvo Cars of N.A. LLC, 794 F.3d 353, 374 (3d Cir. 2015) (internal quotation marks omitted).
Defendants contend that Plaintiffs cannot satisfy the predominance requirement of Rule 23(b)(3). They make two interrelated arguments: (1) Plaintiffs’ theory of liability runs afoul of Comcast because, after the grant of summary judg[261]*261ment on the global conspiracy claim, Plaintiffs’ damages model no longer corresponds to their remaining theory of liability that there were four independent Section 1 conspiracies; and (2) predominance cannot be demonstrated because Plaintiffs’ remaining theory of liability must isolate the harm that each individual reverse-payment settlement agreement caused each individual class member under the doctrine of antitrust standing.27
1. Comcast Argument
Comcast was an antitrust suit brought by a class of Comcast subscribers. The plaintiffs initially had four theories of antitrust impact: (1) “Comcast’s clustering made it profitable for Comcast to withhold local sports programming from its competitors”; (2) “Comcast’s activities reduced the level of competition from ‘overbuilders’ ”; (3) “Comcast reduced the level of ‘benchmark’ competition on which cable customers rely to compare prices”; and (4) “clustering increased Comcast’s bargaining power relative to content providers.” 133 S.Ct. at 1430-31. Their damages model “did not isolate damages resulting from any one theory of antitrust impact,” id. at 1431, and simply “assumed the validity of all four theories of antitrust impact,” id. at 1434. The district court limited its certification order to the overbuilding theory because it was the only antitrust theory capable of classwide proof, but found the predominance requirement to be satisfied even though the damages model was not altered to reflect the only theory of harm remaining. Id. at 1431. A divided panel of our Court affirmed, Behrend v. Comcast Corp., 655 F.3d 182 (3d Cir. 2011), with Judge Jordan writing separately to say that he “would vacate the certification order to the extent it provides for a single class as to proof of damages,” id. at 209 (Jordan, J., concurring in the judgment in part and dissenting in part), because the model of plaintiffs’ .expert “no longer fits Plaintiffs’ sole theory of antitrust impact, and, instead, produces damages calculations that are not the certain result of the wrong,” id. at 217 (internal quotation marks omitted).
The Supreme Court reversed, holding that while the damages model does not .need to be exact, “a model purporting to serve as evidence of damages in [a] class action must measure only those damages attributable to that theory. If the model does not even attempt to do that, it cannot possibly establish that damages are susceptible of measurement across the entire class for purposes of Rule 23(b)(3).” Comcast, 133 S.Ct. at 1433. Because the plaintiffs’ damages model reflected injury from all four alleged antitrust violations, and because only the overbuilding theory of harm remained, the damages model was unable to “bridge the differences between supra-competitive prices in general and supra-competitive prices attributable to the deterrence of overbuilding.” Id. at 1435. [262]*262The Supreme Court explained “[p]rices whose level above what an expert deems ‘competitive’ has been caused by - factors unrelated to an accepted theory of antitrust harm are not ‘anticompetitive’ in any sense relevant here.” Id.
In the cáse' before us, Plaintiffs’ expert, Dr. Leitzinger, created a damages model that calculated the savings to the class if generic entry had occurred earlier. He noted the prices and overcharges actually paid by the class members and compared that to but-for worlds that included the launch of anywhere between one and five generic competitors. Crucially, this model did not allocate damages amongst the five original defendants (Cephalon and the four generic manufacturers), attribute a certain amount of harm from each individual reverse-payment settlement, or identify which class members were harmed by which reverse-payment settlement. In Defendants’ view, because only individual conspiracies remain, any damages, model must reflect the harm caused by each individual conspiracy to each individual class member, and the use of the same damages model that envisioned a global conspiracy “does not even attempt,” Id. at 1433, to correspond to this remaining theory of liability.28
However, Plaintiffs’ theory of liability is not that each individual agreement caused an individual harm, such that a new damages model would be required under Com-cast. Instead, their theory of liability is that each individual agreement contributed to the market-wide harm, and that all five original defendants are jointly and severally liable29 for this harm as concurrent tortfeasors. This theory may ultimately be proven wrong, but it does match Plaintiffs’ damages theory. Defendants next try to argue that Plaintiffs’ theory of liability must isolate the harm from each individual agreement, and that any reliance on joint and several liability conflicts with the requirements of antitrust standing.
2. Antitrust Impact
Defendants argue that Plaintiffs are attempting to circumvent the doctrine of antitrust standing by asserting the theory of joint and several liability. In essence, they are arguing that the joint and several theory of liability is not “plausible in theory,” Hydrogen Peroxide, 552 F.3d at 325, because under the doctrine' of antitrust standing Plaintiffs must show how each individual agreement harmed each individual class member.
In an antitrust class action, “impact often is critically important for the purpose of evaluating Rule 23(b)(3)’s predominance requirement because it is an element of the, claim that may call for individual, as opposed to common, proof.” Id. at 311. A district court must thus undertake a “rigorous assessment of the available evidence and the method or methods by which plaintiffs propose to use the evidence to prove impact at trial.” Id. at 312. The class should only be certified [263]*263“if such impact is plausible in theory [and] it is also susceptible- to proof at trial through available evidence common to-the class.” Id. at 325. This .inquiry often involves an overlap into the merits. Id. at 324.
Defendants argue that, in the absence of the global conspiracy claim, Plaintiffs must prove which class members suffered ah injury under a specific bilateral agreement. They state that under the doctrine of antitrust standing, á class member who would have purchased- generic modafinil from Ranbaxy cannot hold Mylan liable; a class member who would have purchased generic modafinil from Mylan cannot hold Ran-baxy liable; and a class member who would have purchased generic modafinil from Teva cannot hold either Ranbaxy or Mylan liable. If correct, such individualized inquiries would defeat predominance, and Plaintiffs’ joint and several liability theory would not be plausible. We agree with the general proposition that an antitrust plaintiff cannot defeat the doctrine of antitrust standing by resort to common-law tort principles untethered to antitrust law. But Defendants’ objection is misplaced in this case because the common law principle of joint and several liability is being invoked by Plaintiffs for the proper purpose of establishing antitrust impact and therefore antitrust standing.
The doctrine of antitrust standing requires a plaintiff to “prove more than injury causally linked to an illegal presence in the market.” Brunswick Corp. v. Pueblo Bowl-O-Mat, Inc., 429 U.S. 477, 489, 97 S.Ct. 690, 50 L.Ed.2d 701 (1977).30 This inquiry instead looks - to whether the plaintiff suffered an antitrust injury, i.e,, an “injury of the type the antitrust laws were intended to prevent and that flows from that which makes defendants’ acts unlawful.” Id. In Associated General Contractors of California, Inc. v. California State Council of Carpenters (AGC), 459 U.S. 519, 537-38, 103 S.Ct. 897, 74 L.Ed.2d 723 (1983), the Supreme Court stated that many factors go into this determination. We have condensed these factors into a multi-part test:
(1) the causal connection between the antitrust violation and the harm to the plaintiff and the intent by the defendant to cause that harm, with neither factor alone conferring standing; (2) whether the plaintiffs alleged injury is of the type for which the antitrust laws were intended to provide redress; (3) the directness of the injury, which addresses the concerns that liberal application of standing principles might produce speculative claims; (4) the existence of more direct victims of the alleged antitrust violations; and (5) the potential for dupli-cative recovery or complex apportionment of damages. ,
Ethypharm S.A. France v. Abbott Labs., 707 F.3d 223, 232-33 (3d Cir. 2013) (quoting In re Lower Lake Erie Iron Ore Antitrust Litig., 998 F.2d 1144, 1165-66 (3d Cir. 1993)). We have-said that the “directness of injury” is “the focal point by which the remainder of the ACC factors are guided.” Lower Lake Erie, 998 F.2d at 1166 n.19 (citing Holmes v. Sec. Investor [264]*264Prot. Corp., 503 U.S. 258, 269, 112 S.Ct. 1311, 117 L.Ed.2d 532 (1992)).
Defendants rely solely on a pre-AGC case of ours, Mid-West Paper Products Co. v. Continental Group, Inc., 596 F.2d 573 (3d Cir. 1979), which concerned the “all-important[ ] directness factor,” in support of their position that Plaintiffs lack antitrust standing to bring claims against generic manufacturers from whom they would not have purchased. Lower Lake Erie, 998 F.2d at 1167-68 (conducting an analysis of the AGC factors and discussing Midr-West Paper in the directness of injury section).31 In Midr-West Paper, the plaintiff claimed that it “suffered as a direct purchaser of consumer bags from competitors of the defendants, who allegedly were able to charge artificially inflated prices as a consequence of defendants’ price-fixing.” 596 F.2d at 580 (footnote omitted), We held that the plaintiff, who was not a customer of any member of the conspiracy, lacked antitrust standing to sue the conspiracy members even though it paid higher prices as a result of the conspiracy. In other words, the customer of a competitor of conspiracy members was not “one whose protection is the fundamental purpose of the antitrust laws.” Id. at 583 (internal quotation marks omitted).
In reaching this conclusion in Mid-West Paper, we took several factors into consideration. First, we rioted that it would be “almost impossible, and at the very least unwieldy” to calculate the harm to the plaintiff from the conspiracy, because so many variables went into the competitor’s price calculation irrespective of the existence of the monopoly. Id. at 584. The value of any harm caused by the anticom-petitive conduct would be speculative and “would transform this antitrust litigation into the sort of complex economic proceeding” that the direct-purchaser rule32 was adopted in part to prevent. Id. at 585. In addition, the defendants were “not in a direct or immediate relationship” to the plaintiff, and they gained no advantage from the plaintiffs injury. Id. at 583. Moreover, there was another group of victims who were more likely to sue the conspiracy members — those who purchased directly from them — and one of the purposes of the antitrust standing doctrine is to “compensate[ ] those victims who are most likely to assume -the mantle of private attorneys general for the injuries that they suffered.” Id. at 585. For that reason, we “concentrate[ ] the entire award in the hands of the direct purchasers in all but unusual circumstances and thereby giv[e] them an incentive to sue.” Id. If we were to allow the customer of a competitor to sue for treble damages when the “causal link to defendants’ activities is [so] tenuous,” it would “subject antitrust -violators to potentially ruinous liabilities, well in excess of their illegally-earned profits, because ... [violators] would be held accountable for higher prices that arguably ensued in the entire industry.” Id. at 586.
As is clear from the above description, Defendants’ argument that Mid-West Paper means that a customer of. a non-defendant cannot have antitrust standing is an oversimplification, Mid-West Paper reached its result because it wanted to [265]*265ensure that only those who are most directly harmed by the anticompetitive conduct can sue to remedy the antitrust violation. When, as in Mid-West Paper, the anticompetitive conduct is, price-fixing, the only customers who will have antitrust standing are the direct customers of the conspiracy members. The case before us is not about price-fixing. It is, instead, a case about market exclusion, as it concerns conduct that prevents a competitive market from forming at all.33 In such a scenario all market customers should have antitrust standing to sue those engaged in the allegedly anticompetitive conduct because all suffer equally from the foreclosure of choice. See AGC, 459 U.S. at 538, 103 S.Ct. 897 (“[T]he Sherman Act was enacted to assure customers the benefits of price competition, and our prior cases have emphasized the central interest in protecting the economic freedom of participants in the relevant market.”).
In fact, in Lower Lake Erie, we addressed market exclusion in the market for the unloading of iron ore from ships. Traditionally, iron ore was shipped across the Great Lakes, unloaded at railroad-owned docks onto a railroad, and then transported to the steel mills. Lower Lake Erie, 998 F.2d at 1153-55. Large cranes called “hu-letts” were affixed to the docks and were needed to unload the iron ore from the ships, and, because the non-railroad-owned docks were not equipped with huletts, they “were not competitors for this segment of the ore business.” Id. at 1153. A new, less expensive technology was developed that would allow the iron ore to be unloaded without the use of huletts, and thus open the transshipment market to non-railroad-owned docks. Id. The railroad companies suppressed this new technology by threatening non-railroad-owned docks with higher rates, among other measures. Id.
The issue of antitrust standing arose when the steel companies sued-the railroad companies for higher: rates paid to the vessel companies. Id. at 1167. We held that this injury was sufficiently direct, despite the railroad company’s reliance on .Mid-West Paper. Specifically, we noted that even though the steel companies paid higher rates than it otherwise would have to several ore transportation companies— both defendants and non-defendants — “it was unquestionably the steel companies who bore the brunt of the increased costs attributed to the railroad's agreement to thwart development of the less expensive technology.” Id. at 1168; id. (“The steel companies were the sole customers of the industry involved in the transshipment of ore; indeed, the industry existed for them.”). Although there were other victims of the harm, such as the vessel companies, the dock companies, and trucking companies, this did “not diminish the directness of the steel companies’ injury.” Id. at 1168-69.
Unlike in Mid-West Paper, where there was a market for consumer bags and we knew who was buying from whom, there was no market in this case due to Defendants’ allegedly anticompetitive conduct in delaying the availability of generic modafi-nil. Just as the railroad docks and their older, more expensive technology were the steel companies’ only choice in Lower Lake Erie, Cephalon’s brand-name version of modafinil — Provigil—was the only option available to the DPP class. All other options were prevented from entering the [266]*266market by the allegedly anticompetitive conduct of the railroad companies and the drug manufacturers, respectively.
Under Plaintiffs’ theory, each of the four generic manufacturers allegedly entered into separate anticompetitive arrangements with Cephalon.34 If any one of them had refused to enter into this arrangement, there would have been no antitrust injury for anyone, as the market would have worked as envisioned by the Hatch-Waxman Act: there would have been between one and five generic manufacturers competing with the brand-name modafinil during the 180-day exclusivity period, after which there would have been a fully competitive market. However, because all four entered into these reverse-payment settlement agreements and prevented a competitive market from forming, each contributed to the market-wide harm, and each can be held jointly and severally liable for such harm. This is not the sum of four separate individual harms emanating from each agreement; instead, it is a harm that all four agreements work jointly to produce, even if there was no conspiracy between the generic manufacturers. The class member who would have purchased from Teva is harmed by the Ranbaxy and Mylan agreements to the same extent that a Ran-baxy or Mylan customer would be. Thus, any class member would have antitrust standing to sue any or all of the four generic companies individually. There is no need to pursue an individualized inquiry into the harm caused by each agreement, and “questions of law or fact common to class members predominate over any questions affecting only individual members.” Fed. R. Civ. P. 23(b)(3). Defendants’ attempt to dictate Plaintiffs’ theory of liability based on the doctrine of antitrust standing should fail.
iy.
For the reasons stated above, we will vacate the District Court’s class certification order, and we will remand to the District Court for further consideration of whether joinder of all class members is impracticable.
Related
Cite This Page — Counsel Stack
837 F.3d 238, 95 Fed. R. Serv. 3d 1396, 2016 U.S. App. LEXIS 16723, 2016 WL 4757793, Counsel Stack Legal Research, https://law.counselstack.com/opinion/in-re-modafinil-antitrust-litigation-ca3-2016.